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Isolation and characterization of a N. crassa silencing gene 

AND USES THEREOF 



5 The present invention relates to the isolation and 

characterization of a Neurospora crassa gene encoding for 
an essential activity in the co-suppression process and 
to uses and applications thereof in vegetal, animal and 
fungine fields. 

10 The production of transgenic organisms is of large 

utility both in basic and applied biological research. 
The transgenic DNA is usually integrated in the genome 
and transferred as a Mendelian character. However, in 
various instances, the transgene introduction induces 

15 gene silencing phenomena (Flavell, R.B. 1994), i.e. the 
repression of the expression of the transgene itself 
and/or of one or more endogenous homologous genes. 

The gene silencing can act at two levels: 
transcriptional ( trans-inactivation) where transgenes 

20 contain sequences homologous to the silenced gene 
promoter (Vaucheret, 1993); and post-transcriptional (co- 
suppression) which requires homologies between coding 
regions (Flavell, 1994; Stam et al., 1997; Baulcombe, 
1996) . 

25 Generally the silencing induced by a transgene 

requires an almost complete sequence homology (from 70% 
to 100%) between transgene and silenced gene sequences 
(Elkind, 1990) . 

In the Neurospora crassa filamentous fungus, during 

30 the vegetative phase, the presence of transgenes induces 
a post-transcriptional gene silencing phenomenon, named 
"quelling" (Cogoni et al., 1996). 
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By using the al-1 gene (albino 1) (Schmidhauser et 
al., 1990) as silencing visual marker, many features of 
the phenomenon have been discovered (Cogoni et al., 
1996) . Particularly the al-1 gene "quelling" in 
5 Neurospora is characterized in that: 1) the gene 
silencing is reversible further to the loss of transgene 
copies; 2) the reduction of mRNA basal level results from 
a post-transcriptional effect; 3) transgenes containing 
at least a region of 132 base pairs which is identical to 

10 the region encoding for the target gene are sufficient to 
induce the "quelling"; 4) the duplication of promoter 
sequences is ineffective to induce the silencing; 5) the 
"quelling" exhibits a dominant behavior in eterocarions 
containing both transgenic and untransf ormed nuclei, 

15 indicating the involvement of a molecule which is acts 
"in trans" among the nuclei; 6) the expression of an 
aberrant RNA transcribed by the transgenic locus is 
strictly correlated to silencing, suggesting that the 
"quelling" can be induced and/or mediated by a transgenic 

20 RNA molecule. 

Therefore homologies between Neurospora silencing 
and plant co-suppression can be pointed out. The gene 
silencing in Neurospora is reversible, as result of 
transgenic copies instability during mitotic phase; in 

25 plants also the co-suppression reversion is associated 
with the reduction of transgene copy number, resulting 
from intra-chromosomal recombination during mitosis or 
meiosis (Mittelstein Scheid et al., 1994; Stam et al., 
1998). Thus both in plants and in Neurospora the 

30 transgene presence is required to maintain the silencing. 
As in Neurospora, a decrease of the mRNA basal level of 
the silenced gene results from a post-transcriptional 
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mechanism (Dehio and Schell 1994; van Blokand et al., 
1994; de Carvalho et al., 1995). Furthermore to induce 
the "quelling", transgenes must contain a portion of the 
silencing target gene coding sequence, being the promoter 
5 region ineffective. In plants coding regions with no 
promoter sequences can induce silencing (van Blokand et 
al., 1994) and, as in the "quelling", promoters or 
functionally active gene products are not required for 
the co-suppression. 

10 One of the similarities between "quelling" and co- 

suppression in plants is that both mechanisms are 
mediated by diffusion factors. In Neurospora 
eterokaryotic strains, nuclei wherein the albino-1 gene 
is silenced are able to induce the al-1 gene silencing of 

15 the other not transformed nuclei, all sharing the same 
cytoplasmic environment (Cogoni et al., 1996). In plants 
the presence of a diffusion factor results from the fact 
that the co-suppression is effective in inhibiting the 
replication of Tobacco Etch Virus (TEV) , a RNA virus with 

20 an exclusively cytoplasmic cycle. The occurrence of 
highly diffusible factors, which are effective to mediate 
the co-suppression, has been demonstrated using the 
grafting technique in tobacco (Palaqui et al., 1997), 
showing that silenced tobacco plants are able to transfer 

25 the silencing to non-silenced plants through grafting. 

The fact that "quelling" and co-suppression share 
all these features suggests that mechanisms involved in 
post-transcriptional gene silencing in plants and in 
fungi can be evolved by an ancestral common mechanism. 

30 Recently gene inactivation phenomena resulting from 

transgene introduction have been disclosed in animals. In 
Drosophila melanogaster the location of a transgene close 
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to heterochromatic centers results in a variegate 
expression (Wallrath and Elgin, 1995; Pirrotta, V., 
1997) . Similar expression profiles have been observed 
when the reference transgene is within tandem arrayed 
5 transposons, indicating that tandem repeats are effective 
to induce the chromatin condensation. (Dorer. and 
Henikoff, 1994). Again in Drosophila Pal-Bhadra et al. 
(1997) have observed that the transgene introduction can 
lead to gene inactivation phenomena, similar to the co- 
10 suppression. 

Gene silencing phenomena resulting from transegene 
sequence repeats have been disclosed recently in 
mammalians. 

Garrick et al. (1998) produced mouse transgenic 

15 lines wherein 100 transgenic copies are present only in a 
locus and are directly tandem arrayed. The transgene 
expression has been disclosed to be inversely 
proportional to the number of occurring copies, 
indicating that silencing phenomena dependent on repeat 

20 copies are present also in mammalians. 

Therefore the identification of Neurospora genes 
which are involved in the silencing is the first step to 
modulate the same process in plants, animals and fungi. 
The silencing modulation is of great relevance when 

25 transgenic organisms able to express the desired 
phenotype are produced. 

The authors of the present invention have already 
isolated Neurospora crassa strains having mutations 
regarding essential functions for gene silencing 

30 mechanism (Cogoni and Macino, 1997); 15 independent 
isolated mutants define three complementation groups, 
thus identifying the gde-1, qde-2 and qde-3 genes (qde 
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stands for xx quelling"-def icient ) , whose products are 
essential to the silencing machinery. qde genes are 
essential to the Neurospora silencing, as suggested by 
the fact that silencing of three independent genes (al-l, 
5 al-2 and qa-2) is impaired by qde mutations (Cogoni and 
Macino, 1997) . 

The authors of the invention have identified and 
cloned now one out of Neurospora qde genes, thus 
identifying one of required factors for silencing. By 
10 considering the similarity between "quelling" and co- 
suppression, genes orthologous to the isolated gene are 
involved in co-suppression and more generally in gene 
silencing in other organisms, like plants, fungi and 
animals . 

15 The present invention can be applied with reference 

to two general scope: 1) silencing potentiation as a tool 
for inactivating more effectively and durably a desired 
gene, and 2) silencing suppression to obtain a better 
expression of the introduced transgenes. 

20 As to the silencing potentiation, the over- 

expression of one or more genes controlling the 
phenomenon can lead to higher efficiency and/or stability 
thereof. Therefore the introduction of qde-3 gene or of 
homologous genes thereof in microorganisms can constitute 

25 a tool to repress more effectively gene functions. 
Particularly this approach is specially useful in plants 
wherein the co-suppression is usually used for the 
"knock-out" of gene functions. In plants again the gene 
silencing potentiation can be used to obtain lines 

30 resistant to pathogen virus, by introducing transgenes 
encoding for viral sequences, in order to achieve the 
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expression inhibition of the virus itself (Flavell et 
al., 1994) . 

Analogous applications are suitable for animals, 
wherein some indications suggest that silencing can 
5 inhibit the suitable expression of introduced transgenes 
(Garrick et al., 1998). 

On the contrary, there are instances wherein it is 
desirable not to have or to reduce the gene silencing, 
i.e. where a transgene is to be over-expressed. It is 

10 known that the co-suppression is strictly correlated both 
with the presence of an high copy number of the 
transgene, and with a transgene high expression. This 
correlation can hamper the production of transgenic 
organisms which express a transgene at high levels, 

15 because more high is the expression and/or the copy 
number, more probable is to evoke silencing responses. As 
above mentioned, analogous mechanisms of gene 
inactivation, dependent on a high copy number, have been 
disclosed in animals. In these circumstances plant or 

20 animal lines, totally or partially ineffective for 
silencing, constitute an ideal recipient wherein the 
desired gene can be over-expressed. The invention can be 
applied within this scope using different approaches: 

A) Identification and production of mutant lines in 

25 genes homologous to qde-3 gene, in plants, animals and 
fungi. 

The knowledge of Neurospora qde-3 gene, essential 
for silencing mechanism, can allow the isolation of 
mutant lines in other organisms, mutated in genes 
30 homologous to qde-3. For example by means of 
amplif ications using degenerated primers, designed from 
the most conserved regions of qde-3 gene, mutant lines in 
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homologous genes can be identified, by analysis of 
insertion mutant gene banks, already available for many 
plant species. Both in fungi and animals such mutants can 
be obtained, following the identification of the 
5 homologous gene, by means of "gene disruption" techniques 
using homologous recombination. 

B) Reduction of qde-3 gene expression 
Other strategies for the production of silencing- 
deficient lines comprise the use of Neurospora qde-3 gene 

10 or homologous genes thereof, qde-3 or homologous genes 
can be introduced into suitable expression vectors to 
express them in an anti-sense orientation in order to 
inhibit the expression of resident endogenous genes. 
Alternatively portions of qde-3 or of homologous genes 

15 can be over-expressed, in order to obtain a negative 
dominant effect and thus blocking the function of qde-3 
endogenous genes. 

The authors of the present invention have cloned 
and characterised the Neurospora crassa qde-3 gene. The 

20 sequence analysis showed that qde-3 gene belongs to a 
highly conserved gene family, from E. coli to humans, 
named recQ. Genes belonging to this family encode for DNA 
helicase, as demonstrated by in vitro assays (Gray et 
al., 1997). The recQ helicase family is involved in 

25 recombinant processes. Mutations of these genes produce 
iper-recombinant phenotypes as, for example, the S. 
cerevisiae . Sgs-1 gene involved both in meiotic and 
mitotic recombination. 

The authors of the invention for the first time 

30 have demonstrated that a gene encoding for a recQ DNA- 
helicase is involved in gene silencing induced by 
transgenes. Therefore, for the first time it is disclosed 



that a gene belonging to the recQ family, other than acts 
during recombination, is also an essential component of 
the inactivation of repeat sequences. 

Therefore it is an object of the invention a 
5 nucleotide sequence encoding for a protein characterized 
in having a silencing activity and comprising a recQ 
helicase domain, wherein the domain is at least 30% 
homologous with the amino acid sequence from aa. 8 97 to 
aa. 1330 of SEQ ID No.l. More preferably said homology is 

10 of at least of 60%. Most preferably the recQ helicase 
domain comprises the amino acid sequence from aa. 897 to 
aa. 1330 of SEQ ID No.l. According to a particular 
embodiment the nucleotide sequence encodes for a protein 
having the amino acid sequence of SEQ ID No. 1, or 

15 functional portions thereof. Even more preferably the 
nucleotide sequence of the invention is the sequence of 
SEQ ID No. 1 or its complementary sequence. 

A further object of the invention is an expression 
vector comprising, under the control of a promoter that 

20 is expressed in bacteria, the nucleotide sequence of the 
invention. Those skilled in the art will appreciate that 
any plasmid suitable for a correct and effective 
expression of the protein of the invention in bacteria 
can be used and is within the scope of the invention. 

25 A further object of the invention is an expression 

vector comprising, under the control of a promoter which 
is expressed in plants or in specific plant organs, the 
nucleotide sequence of the invention, both in a sense and 
anti-sense orientation. Those skilled in the art will 

30 appreciate that any plasmid suitable for a correct and 
effective expression of the protein of the invention in 
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plants or in specific plant organs can be used and is 
within the scope of the invention. 

A further object of the invention is an expression 
vector comprising, under the control of a promoter which 
5 is expressed in fungi or in portions thereof, the 
nucleotide sequence of the invention, both in a sense and 
anti-sense orientation. Those skilled in the art will 
appreciate that any plasmid suitable for a correct and 
effective expression of the protein of the invention in 

10 fungi or in portions thereof can be used and is within 
the scope of the invention. 

A further object of the invention is an expression 
vector comprising, under the control of a promoter that 
is expressed in animals, the nucleotide sequence of the 

15 invention both in a sense and anti-sense orientation. 
Those skilled in the art will appreciate that any plasmid 
suitable for a correct and effective expression of the 
protein of the invention in animals can be used and is 
within the scope of the invention. 

20 A further object of the invention is a prokaryotic 

organism transformed by using the expression vector 
active in bacteria of the invention. 

A further object of the invention is a plant or a 
specific plant organ transformed by using the expression 

25 vector active in plants of the invention. 

A further object of the invention is a plant 
mutated at the nucleotide sequence of the invention and 
having a reduced or inhibited silencing activity. 

A further object of the invention is a fungus 

30 transformed with the expression vector of the invention 
active in fungi . 
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A further object of the invention is a fungus 
mutated at the nucleotide sequence of the invention and 
having a reduced or inhibited silencing activity. 

A further object of the invention is a non-human 
5 animal transformed with the expression vector of the 
invention active in animals. 

A further object of the invention is a non-human 
animal mutated at the nucleotide sequence of the 
invention and having a reduced or inhibited silencing 
10 activity- 

A further object of the invention refers to a 
protein characterized in having a silencing activity and 
in comprising a recQ helicase domain , wherein the domain 
is at least 30% homologous to the amino acid sequence 
15 from aa. 8 97 to aa. 1330 of SEQ ID No.l. Preferably the 
recQ helicase domain is at least 4 0% homologous with the' 
amino acid sequence from aa. 8 97 to aa. 1330 of SEQ ID 
No.l. More preferably the recQ helicase domain is at 
least 60% homologous with the amino acid sequence from 
20 aa. 897 to aa. 1330 of SEQ ID No.l. Most preferably the 
recQ helicase domain comprises the amino acid sequence 
from aa. 897 to aa. 1330 of SEQ ID No.l. According to a 
particular embodiment the protein comprises the amino 
acid sequence of SEQ ID. No.l or functional portions 
25 thereof. 

It is within the scope of the invention the use of 
the nucleotide sequence of the invention to modulate gene 
silencing in plants , animals and fungi. 

It is within the scope of the invention the use of 
30 the nucleotide sequence of the invention to potentiate 
the antiviral-response in a plant. 
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The present invention now will be disclosed by way 
of non limiting examples with reference to the following 
figures: 

Figure 1: Southern blot analysis of genomic DNA 
extracted from (A) : untransf ormed wild type strain, (B) : 
6xw recipient strain and (C) : untransf ormed wild type 
strain, Smal and Hindlll digested, blotted and al-1 gene 
probe hybridized. The 3. 1-Kb band corresponds to the 
endogenous al-1 gene, while the 5.5-Kb band corresponds 
to tandem arrayed al-1 transgenes. The larger band 
represents undigested methylated DNA. 

Figure 2: Linear map of the pMXY2 plasmid. Plasmid 
genes are shown as box. bml : beta-tubulin allele which is 
responsible for benilate resistance; Amp: ampicillin 
resistance; qa-2 P: qa-2 gene promoter; TrpC T: trpC gene 
terminator. SphI and Bglll are restriction sites used for 
the plasmid recovery from the 627 mutant chromosomal DNA. 

Figure 3: Schematic representation of pQD6 and pQ35 
plasmids. Restriction sites (Bglll for pQD6 and SphI for 
pQ35) used for the recovery of the chromosomal DNA of the 
627 strain are reported. Chromosomal sequences, flanking 
the integration site, are represented as segments. 
Restriction sites used to isolate DNA fragments used for 
probing the gene library are also represented. 

Figure 4: Nucleotide sequence of the 6.9-Kb 
fragment containing the gde-3 gene and flanking 
sequences. The amino acid sequence is shown above the 
nucleotide sequence. The bold sequences represent two 
introns of 98 and 68 nt. In these regions the underlined 
nucleotides identify consensus sequences of the donor 
site, the acceptor site and the internal sequence or 
lariat. It is also represented the pMXY2 plasmid 
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insertion site, in the 627 mutant, used for insertional 
mutagenesis. The portion encoding for the helicase domain 
is underlined. 

Figure 5: Nucleotide sequence (SEQ ID No. 1) of the 
encoding portion reported in Figure 4 and deduced amino 
acid sequence. Amino acids from 8 97 to 1330, which define 
the recQ DNA-helicase domain, are underlined. 

Figure 6: Multiple alignment, at the conserved 
domains, among qde-3 and other proteins belonging to recQ 
family, arab recQ: A. thaliana isologous; E. coli recQ; 
S. pombe hus-2; S.- cerevisiae sgs-1; human wrn: Werner 
syndrome; human blm: Bloom syndrome. Identical amino 
acids are shown in bold. 
MATERIALS AND METHODS 
E. coli strains 

E. coli strain HB101 (F~, hsdS20(rb", mb") , supE44, 
recA13, aral4, proA2, rspL20 (str r ) , xyl-5) was used for 
cloning. 

Neurospora crassa strains and growing conditions 

Neurospora crassa following strains, supplied by 
Fungal Genetic Stock Center (FGSC, Dpt. Of Microbiology, 
University of Kansas Medical Ctr. Kansas City, KA) were 
used: 

- Wild type (FGSC 987) ; 

- qa-2/aro9 (FGSC 3957A) , (FGSC 3958a). 

The 6XW strain (Cogoni et al., 1996) was obtained 
upon transformation of the FGCS 3958a strain with pX16 
(Cogoni et al., 1996). This plasmid contains the qa-2 
gene used as selective marker and the al-l coding 
sequence . 
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The mutated strains M7, M20 (qde-1) ; M10, Mil (qde- 
2); M17, M18 (gde-3) are described in Cogoni and Macino, 
1997. 

The qde mutants were obtained by UV mutagenesis. As 
5 recipient the transforming strain (6xw) silenced at the 
albino-1 gene was used, qde mutants were selected for 
their ability to recover a wild type unsilenced phenotype 
and then classified in three different complementation 
groups. By analyzing the al-2 gene quelling frequency all 
10 of qde used mutants are defective for the general 
silencing mechanism. 

Complementation assays with not forced 
heterocaryons were carried out according to Davis and 
DeSerres, 1970. 
15 Plasmids and libraries 

The plasmid pMXY2, disclosed in Campbell et al., 
used for insertional mutagenesis was obtained from FGSC. 
The plasmid contains the Bml gene (allele responsible of 
the benilate drug resistance) , that was used as selective 
20 marker after transformation. The genomic DNA containing 
the qde-2 gene was isolated from a N. Crassa gene library 
in cosmids. (Cabibbo et al., 1991). 
N. crassa transformation 

Spheroplasts were prepared according to the Akins 
25 and Lambowitz (1985) protocol. 
Southern Blot Analysis 

Chromosomal DNA was prepared as disclosed by Irelan 

et al., 1993. 5 Jig of genomic DNA were digested and 
blotted as reported in Maniatis et al. 
30 DNA probes were: a) as to the al-1 gene the probe 

is represented by a Xbal-Clal restriction fragment of 
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pX16 (Cogoni et al., 1996); b) as to the Bml gene the 
probe is represented by the 2,6Kb Sail fragment of pMXY2 . 
Northern Blot Analysis 

N. crassa total RNA was extracted according to the 
5 protocol described by Cogoni et al., 1996. The mycelium 
was grown for two days at 30°C / then powdered in liquid 
nitrogen before RNA extraction. For Northern analysis 10 
jag of RNA were formaldehyde denatured, electrophoresed on 
a 1% agarose, 7% formaldehyde gel, and blotted over 
10 Hybond N (Amersham) membranes. Hybridization was carried 
out in 50% formamide in the presence of 32 P labeled DNA 
probe 1.5xl0 6 cpm/ml. 
RESULTS 

Isolation of silencing mutant by insertional mutagenesis 

15 Neurospora strain (6XW) wherein the albino-1 

resident gene was steadily silenced was UV mutagenised, 
and qde ("quelling" deficient) mutants were isolated 
(Cogoni and Mancino 1997) . The 6XW strain shows an albino 
phenotype due to the lack of carotenoid biosynthesis, as 

20 results by the silencing of the albino 1 gene expression 
(Schmidhauser et al., 1990). A mutation interfering with 
the silencing machinery is easily detectable by producing 
a wild type phenotype (bright orange) of the carotenoid 
biosynthesis. By means of complementation assays it was 

25 possible to establish that qde mutants belong to three 
complementation groups, indicating the presence of three 
genetic loci involved in the Neurospora silencing 
mechanism. In order to isolate the qde genes an 
insertional mutagenesis was carried out with the 6XW 

30 strain, previously used for UV mutagenesis. The 
insertional mutagenesis was carried out by transforming 
the 6XW strain with a plasmid, taking advantage of the 
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fact that, after the transformation, plasmids are 
randomly inserted in the Neurospora crassa genome. The 
mutagenesis was carried out transforming the 6XW silenced 
strain with pMXY2 (see Materials and Methods) which 
5 contains the benilate resistance as selective marker. 
Transformed strains able to grow in the presence of 
benilate containing medium and showing a wild type 
phenotype for the carotenoid biosynthesis were selected. 
Out of 50.000 isolated independent transformed strains, a 

10 benilate resistant strain (627) was isolated, which 
showed the bright orange phenotype expected for a gde 
gene mutation. In order to verify that the silencing 
release was effectively due to a gde gene mutation and 
not to the loss of al-1, the genomic DNA of the strain 

15 627 was extracted and digested with Smal and Hindlll 
restriction enzymes. After blotting, DNA was hybridized 
with a probe corresponding to the coding sequence of al- 
1. The Smal site is present only once in the al-1 
transgene containing, plasmid and the digestion by using 

20 said enzyme produces a 5.5Kb fragment corresponding to 
tandem arrayed al-1 transgenes, while a 3.1Kb fragment is 
expected from the resident al-1 locus. Figure 1 shows 
that the number of al-1 transgenic copies present in the 
627 strain is comparable to that present in the silenced 

25 6XW strain. 

The 627 strain includes a mutated qde3 gene 

The 627 strain was assayed in a heterokaryon assay 
with a wild type strain and with M7, M20 (gde-2) M10, Mil 
(qde-2) mutants (Cogoni and Macino, 1997) . As shown in 

30 Table 1 the al-2 gene silencing is restored producing an 
albino phenotype in all of heterocaryons but M17 and M18. 
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This behavior is consistent with the presence of a qde-3 
gene recessive mutation in the 627 strain. 

Table 1 

5 Reciprocal heterokaryons among 627 mutant and previously 



characterized gde mutants. 
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AL 


WT 


WT 
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WT 


WT 


AL 




AL 


AL 


AL 
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WT 


AL 




AL 


AL 


AL 


M10 








WT 




WT 


AL 


AL 


Mil 












WT 


AL 


AL 


M17 














WT 


WT 


M18 
















WT 



WT = heterokaryon with a wild type phenotype for 
carotenoid; 

AL = heterokaryon with an albino phenotype wherein the 



10 al-1 gene silencing is restored. 

Recovery of sequences flanking the pMXY2 plasmid 
integration site 

In order to recover sequences flanking the 

15 integration site or sites the following methodology was 
carried out. The . 627 strain genomic DNA was restricted 
with SphI and Bglll enzymes. As shown in the map of 
Figure 2 the enzymes digest respectively upstream and 
downstream to the region containing both the ampicillin 

20 resistance gene and the origin of replication present in 
pMXY2. Subsequently the genomic DNA was ligated and the 
product used to transform E. coli cells. The screening 
was performed in an ampicillin-containing medium. pQD6 
and pQ35 plasmids were recovered from Bglll and SphI 
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restricted chromosomal DNA, respectively (see Figure 3) . 
Two DNA fragments containing sequences flanking the 
integration site were isolated by using, respectively, 
Bglll and Sail enzymes for pQD6, and SphI and Hindlll 
5 enzymes for pQ35 (Figure 3) . 

Isolation of genomic clones, their subcloning and 
complementation of the qde-3 mutant 

The two fragments from pQD6 and pQ35 plasmids were 
used to probe a Neurospora crassa genomic library in 

10 , cosmids. Cosmids 6E8 and 54D7, both containing about 30 
Kb genomic DNA inserts, were isolated. Both the probes 
recognize the same cosmids, thus indicating that the two 
flanking sequences are contiguous. Cosmids 6E8 and 54D7 
were used in transformation experiments with M17 and M18 

15 mutants. Both of cosmids are able to restore the al-1 
gene silencing in the two mutants, determining an albino 
phenotype. Furthermore the introduction of same cosmids 
into the M10 (gde-2) or the M20 mutant (gde-1) is not 
effective to restore the silencing. 

20 The 6E8 cosmid was used to subclone a 9 Kb Sphl- 

SphI fragment. This subclone was used for transformation 
experiments and resulted to be able to complement the 
qde-3 phenotype, indicating that a qde-3 functional gene 
is present in this plasmid. 

25 Isolation and sequence of the qde-3 cDNA 

The Sphl-SphI region was sequenced, like the 
corresponding cDNA, by using RT-PCR. The latter sequence 
was used to deduce the qde-3 amino acid sequence and map 
the introns therein. The qde-3 gene encodes for a 1900 

30 aa. putative protein (200 KDa) . The genomic clone 
contains two introns of 98 nt. and 68 nt., respectively. 
Intron acceptor and donor sequences were identified and 
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correspond to described consensus sequences (Figure 4). 
Furthermore the pMXY2 plasmid insertion site within the 
gene in the 627 transforming strain is indicated. The 
insertion site was deduced by analysis of pQD6 and pQ35 
plasmid sequences. 

The cDNA sequence is shown in Figure 5 (SEQ ID No, 
I), wherein the helicase domain containing 4 34 amino 
acids from 897 aa to 1330 aa is underlined. 
The qde-3 gene is belonging to recQ helicase DNA family 

The 1900 aa sequence was used to search in database 
of amino acid sequences, by using the BLASTP algorithm. 
Significant homologies were identified with 6 genes 
belonging to the reQ family, belonging to the helicase 
group containing the DEAH consensus sequence. Figure 6 
shows the homologous region sequence alignment of 
helicase domains, as defined in Figure 5, among qde-3 and 
genes belonging to recQ helicase family, qde-3 shows the 
highest homology with hus-2 (55% amino acid identity) and 
the lowest homology with IVm (40% identity) . 
Plant expression vector 

The qde-3 gene was inserted, in a sense 
orientation, into a vector containing a plant expression 
"cassette", including the 35S promoter and the PI-II 
"terminator" sequences. The vector also includes the 
Streptomyces hygroscopicus bar gene, which confers the 
phosphinotricine herbicide resistance to transformed 
plants. In an analogous vector, qde-3 was inserted in an 
anti-sense orientation with respect to the 35S promoter. 

The obtained vectors can be utilized to over- 
express the qde-3 gene in plants, or to repress the gene 
expression of resident genes, which are homologous to 
qde-3, respectively . 
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Fungus expression vector 

The qde-3 gene was inserted in a vector containing 
a fungal specific expression "cassette", comprising the 
A. nidulans trpC gene promoter and terminator, both in a 
5 sense and an anti-sense orientation. In addition the 
vector contains the bacterial hph gene, which confers the 
hygromicine drug resistance. The sense plasmid can be 
used to over express the qde-3 gene, whereas the anti- 
sense plasmid is used to repress the expression of qde-3 

10 homologous genes in various fungine species. 
Mammalian expression vector 

The qde-3 gene was inserted in a vector containing 
a mammalian specific expression "cassette", including the 
cytomegalovirus (CMV) promoter and SV4 0 termination and 

15 polyadenylation sequences both in a sense and anti-sense 
orientation. The vector includes also the neomicine 
phototransferase gene, as marker for mammalian cell 
selection. The sense plasmid can be used to over express 
the qde-3 gene, whereas the anti-sense plasmid can be 

20 used to repress the expression of qde-3 homologous genes 
in various mammalian species. 
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Claims 

1. Nucleotide sequence encoding for a protein 
characterized in having a silencing activity and in 

5 comprising a recQ helicase domain, wherein the domain is 
at least 30% homologous with the amino acid sequence from 
aa. 897 to aa. 1330 of SEQ ID No.l. 

2. Nucleotide sequence encoding for a protein 
characterized in having a silencing activity and 

10 comprising a recQ helicase domain according to claim 1, 
wherein the domain is at least 40% homologous with the 
amino acid sequence from aa. 897 to aa.. 1330 of SEQ ID 
No.l. 

3 . Nucleotide sequence encoding for a protein 
15 characterized in having a silencing activity and 

comprising a recQ helicase domain according to claim 2, 
wherein the domain is at least 60% homologous with the 
amino acid sequence from aa. 897 to aa. 1330 of SEQ ID 
No.l. 

20 4. Nucleotide sequence encoding for a protein 

characterized in having a silencing activity and 
comprising a recQ helicase domain according to claim 3, 
wherein the recQ helicase domain is the amino acid 
sequence from aa. 897 to aa. 1330 of SEQ ID No.l. 

25 5. Nucleotide sequence encoding for a protein 

characterized in having a silencing activity and 
comprising a recQ helicase domain according to claim 4, 
wherein said nucleotide sequence encodes for a protein 
having the amino acid sequence of SEQ ID No. 1, or 

30 functional portions thereof. 

6. Nucleotide sequence encoding for a protein 
characterized in having a silencing activity and 
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comprising a recQ helicase domain according to claim 5, 
wherein said nucleotide sequence is the sequence of SEQ 
ID No. 1 or its complementary sequence. 

7. Expression vector comprising, under the control 
5 of a promoter that is expressed in bacteria, the 

nucleotide sequence according to any one of claims 1-6. 

8. Expression vector comprising, under the control 
of a promoter that is expressed in plants or in specific 
plant organs, the nucleotide sequence according to any 

10 one of claims 1-6, both in a sense and anti-sense 
orientation. 

9. Expression vector comprising, under the control 
of a promoter that is expressed in fungi, the nucleotide 
sequence according to any one of claims 1-6 both in a 

15 sense and anti-sense orientation. 

10. Expression vector comprising, under the control 
of a promoter that is expressed in animals, the 
nucleotide sequence according to any one of claims 1-6 
both in a sense and anti-sense orientation. 

20 11. Prokaryotic organism transformed by using the 

expression vector active in bacteria according to claim 
7. 

12. Plants or a specific plant organ transformed by 
using the expression vector active in plants according to 

25 claim 8. 

13. Plant mutated at the nucleotide sequence 
according to any one of claims 1-6 having a reduced or 
inhibited silencing activity. 

14. Fungus transformed by using the expression 
30 vector active in fungi according to claim 9. 
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15. Fungus mutated at the nucleotide sequence 
according to any one of claims 1-6 having a reduced or 
inhibited silencing activity. 

16. Non-human animal transformed by using the 
5 expression vector active in animals according to claim 

10. 

17 . Non-human animal mutated at the nucleotide 
sequence according to any one of claims 1-6 having a 
reduced or inhibited silencing activity. 

10 18. Protein characterized in having a silencing 

activity and comprising a recQ helicase domain wherein 
the domain is at least 30% homologous to the amino acid 
sequence from aa. 8 97 to aa. 1330 of SEQ ID No.l. 

19. Protein characterized in having a silencing 
15 activity and comprising a recQ helicase domain according 

to claim 18 wherein the domain is at least 4 0% homologous 
to the amino acid sequence from aa. 8 97 to aa. 1330 of 
SEQ ID No.l. 

20. Protein characterized in having a silencing 
20 activity and comprising a recQ helicase domain according 

to claim 19 wherein the domain is at least 60% homologous 
to the amino acid sequence from aa. 8 97 to aa. 1330 of 
SEQ ID No.l. 

21. Protein characterized in having a silencing 
25 activity and comprising a recQ helicase domain according 

to claim 20 wherein the domain is the amino acid sequence 
from aa. 897 to aa. 1330 of SEQ ID No.l. 

22. Protein characterized in having a silencing 
activity and comprising a recQ helicase domain according 

30 to claim 21 comprising the amino acid sequence of SEQ ID. 
No.l or functional portions thereof. 
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23. Use of the nucleotide sequence according to any 
one of claims 1-6 to modulate the gene silencing in 
plants, animals and fungi, 

24. Use of the nucleotide sequence according to any 
one of claims 1-6 to potentiate the antiviral-response in 
a plant. 
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CCA CCA CCA CCA CCA CCA CCA CAA CAA CCA ACT CAA CAA CGA ACT CAA CAA CCA 
9 18 27 36 45 54 

ACC CCA AAC CCA ACC TCG ACC TCA ACC TCA ACC TCA ACC CTT GCG ACC TCG AGA TAC ACA 
63 72 81 90 99 108 

AAC ACA TCC TCG ACA AAT GAC GCC CGA CCC GCT ACA CGC CAA CAG ATT GCC CCC GAG GTA 
123 132 141 150 159 168 

M A 

GGA GCA TCG ACG CAT CAG GAT TCA GTT GGA CTT GGA GAA GGA GGA GGA GGA GGA ATG GCG 
183 192 201 210 219 228 

KLSVKNNLPRPHLVSLSSST 
AAG CTC TCA GTC AAG AAC AAC CTG CCA CGG CCG CAC TTG GTC TCC TTG TCG TCG TCA ACG 
243 252 261 270 279 288 

TGSGSGSA SRSASAKHGSAG 
ACA GGC TCT GGG TCT GGT TCT GCG TCT AGG TCA GCT TCT GCT AAG CAC GGA AGT GCC GGT 
303 312 321 330 339 348 

SSTFDHEQHQQHQQQQQQKR 
TCC AGT ACC TTT GAT CAT GAA CAA CAT CAA CAA CAT CAA CAA CAA CAA CAA CAA AAG CGC 
363 372 381 390 399 408 

QRSQSEARQQQQQQQQQQQQ 
CAG CGG TCG CAA TCA GAA GCA CGA CAA CAG CAG CAG CAA CAG CAA CAG CAA CAG CAA CAG 
423 432 441 450 459 468 

QQQQQQAQHHAHSTYAQRPQ 
CAA CAA CAA CAA CAA CAA GCA CAG CAC CAT GCA CAT TCT ACA TAT GCA CAA AGA CCC CAA 
483 492 501 510 519 528 

PTPQQRPP QNLLTPASTTGA 
CCC ACC CCC CAA CAA CGA CCA CCC CAA AAC CTA CTG ACA CCT GCT TCA ACC ACT GGT GCC 
543 552 561 570 579 588 

SVGPLQRAYSVSLAARQS PS 
AGC GTC GGC CCG CTC CAA CGC GCA TAC TCG GTT TCA TTA GCT GCG AGA CAG TCC CCC TCG 
603 612 621 630 639 648 

TNLVRPKTDS PAPHTLHLKN 
ACA AAC TTG GTC CGT CCA AAG ACC GAC TCG CCA GCT CCC CAC ACT TTA CAC CTC AAG AAC 
663 672 681 690 699 708 

KKNLRHPAPT PDSP IV DDDI 
AAG AAG AAC CTC CGT CAC CCC GCC CCC ACG CCC GAC AGT CCG ATC GTA GAC GAC GAT ATT 
723 732 741 750 759 768 

FSDAVDLTEELDHDHDLNGK 
TTC TCC GAC GCC GTC GAT CTT ACC GAA GAA CTC GAT CAT GAC CAT GAT CTC AAC GGC AAA 
783 792 801 810 819 828 

DKDNTDNDNTVA SSSLIGFG 
GAC AAA GAC AAC ACC GAC AAC GAC AAC ACA GTC GCT TCC AGT TCG CTA ATA GGG TTC GGC 
843 852 861 870 879 888 

DDKLLWREDFAERAE P EHER 
GAT GAC AAG TTA CTG TGG CGA GAG GAC TTT GCT GAG CGT GCA GAG CCC GAA CAT GAA AGA 
903 912 921 930 939 948 

GGSRPRQVKKRKISNDYIMK 
GGT GGG AGC AGG CCT CGC CAG GTC AAG AAA CGG AAG ATA TCG AAT GAC TAC ATT ATG AAG 
963 972 981 990 999 1008 

DE DVSLFDDDGEEDEFMDIN 
GAT GAG GAT GTC TCG CTT TTT GAT GAT GAT GGC GAG GAG GAC GAG TTT ATG GAT ATC AAT 
1023 1032 1041 1050 1059 1068 
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ELVQGDRESTPKPKAT SRSV 
GAG CTA GTT CAG GGG GAT CGG GAA AGT ACT CCG AAG CCA AAG GCT ACA TCG AGG TCT GTC 
1083 1092 1101 1110 1119 1128 

STRLPPTVSLQRGRSPKRKE 
TCG ACG AGG CTG CCG CCT ACA GTA TCG CTG CAA CGG GGT CGG TCT CCT AAG AGG AAG GAG 
1143 1152 1161 1170 1179 1188 

ASVE KRTTENQQQ ADREDE P 
GCT TCA GTT GAA AAG CGC ACA ACG GAA AAC CAG CAA CAG GCT GAC AGA GAA GAC GAA CCG 
1203 1212 1221 1230 1239 1248 

SFMSSPDVDNSRKRKSSGSP 
TCG TTT ATG TCA AGT CCA GAT GTC GAC AAC TCC CGC AAG CGA AAG TCT TCT GGA TCG CCC 
1263 1272 1281 1290 1299 1308 

TGLTT PRPQQKQTEEVPGTT 
ACA GGT TTA ACG ACG CCA AGA CCC CAG CAG AAG CAA ACG GAA GAG GTC CCA GGT ACG ACC 
1323 1332 1341 1350 1359 1368 

TAKKPRRSEVMDSEDEAFTP 
ACC GCC AAG AAG CCA CGG CGC AGT GAA GTG ATG GAC TCG GAG GAC GAG GCA TTC ACT CCT 
1383 1392 1401 1410 1419 1428 

LSAGSLPGSAEFFRSGGTTT 
CTT TCT GCT GGG TCG CTG CCT GGG AGT GCG GAG TTC TTC AGA AGC GGT GGG ACC ACC ACA 
1443 1452 1461 1470 1479 1488 

RELGLDEDTVMDTPSRPPVE 
CGG GAA TTG GGT TTG GAC GAA GAC ACG GTT ATG GAC ACG CCT AGT AGG CCA CCG GTC GAG 
1503 1512 1521 1530 1539 1548 

STLPTLESVESRPPPLPPMD 
TCC ACT TTG CCA ACT CTC GAG TCT GTG GAA AGT CGA CCA CCC CCC CTG CCG CCC ATG GAT 
1563 1572 1581 1590 1599 1608 

LPSQRKPLEPLNTPRNQLLE 
CTA CCA TCA CAG CGA AAA CCG CTA GAG CCG TTG AAC ACT CCG CGC AAC CAG CTG CTT GAG 
1623 1632 1641 1650 1659 1668 

SVERPTQQPSVGPSFAQSST 
TCG GTC GAA AGG CCA ACA CAG CAG CCG TCG GTG GGG CCG AGT TTT GCA CAG AGT AGC ACA 
1683 1692 1701 1710 1719 1728 

LAESSLPPSMPPPSEDPLNT 
CTC GCC GAA AGC TCC CTG CCG CCG TCA ATG CCG CCG CCA AGT GAA GAC CCC CTC AAC ACC 
1743 1752 1761 1770 1779 1788 

RENSNLEEFDYKLYKPLLDL 
AGG GAG AAC AGC AAC CTT GAG GAG TTC GAC TAC AAG CTT TAC AAA CCC CTG CTA GAT CTT 
1803 1812 1821 1830 1839 1848 

FVNAPAI LERELSAVNDELQ 
TTC GTC AAC GCA CCC GCA ATC TTG GAA AGA GAA CTG AGC GCC GTT AAT GAC GAG CTT CAG 
1863 1872 1881 1890 1899 1908 

EN MIKLRDCLRLPREERD RA 
GAG AAC ATG ATC AAG CTG CGG GAC TGT CTG CGC CTG CCC AGG GAA GAA AGA GAC AGG GCA 
1923 1932 1941 1950 1959 1968 

REEVKKEKEMLKRRD IAL R*" A 
CGC GAA GAG GTG AAG AAG GAA AAG GAA ATG CTC AAG CGA CGG GAC ATT GCG CTC AGA GCC 
1983 1992 2001 2010 2019 2028 

LQDEHKLYVKKRKEHNLINE 
CTC CAG GAC GAA CAC AAG TTG TAC GTC AAG AAA CGC AAA GAG CAT AAT TTG ATC AAC GAG 
2043 2052 2061 2070 2079 2088 

EIVRAYAEEDDEYEDQLMAQ 
GAA ATC GTT CGC GCT TAT GCT GAA GAA GAC GAT GAG TAC GAG GAT CAG TTA ATG GCG CAG 
2103 2112 2121 2130 2139 2148 
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LDKLDDEVEAIVKSLTRLIV 
CTG GAC AAG TTG GAT GAT GAG GTT GAG GCT ATC GTA AAG AGT CTG ACA AGG CTT ATT GTG 
2163 2172 2181 2190 2199 2208 

AAG I T EKS FDLKKE E E EEEE 
GCG GCG GGG ATC ACG GAG AAG AGC TTT GAC CTA AAG AAG GAG GAG GAA GAG GAG GAG GAG 
2223 2232 2241 2250 2259 2268 

KPI I IATPTPSTRTEAPVLP 
AAG CCG ATC ATC ATA GCG ACT CCG ACA CCT TCG ACG AGG ACC GAG GCC CCG GTT CTG CCG 
2283 2292 2301 2310 2319 2328 

TTEYHNSQQVI LQTQHPAAQ 
ACG ACC GAG TAT CAT AAT TCC CAG CAG GTC ATA TTG CAG ACT CAA CAT CCT GCT GCG CAG 
2343 2352 2361 2370 2379 2388 

QVSHRVPPPPTPSFQTARQT 
CAG GTT TCT CAC CGG GTG CCA CCA CCT CCG ACA CCG AGT TTT CAA ACG GCG CGC CAG ACT 
2403 2412 2421 2430 2439 2448 

PVS YQSRPTNNS FPDI SAEE 
CCG GTG TCA TAT CAG AGC AGA CCG ACC AAC AAC TCC TTT CCT GAT ATC TCG GCG GAA GAA 
2463 2472 2481 2490 2499 2508 

AMMF DKEDPFMEQQHAPASA 
GCC ATG ATG TTC GAT AAA GAA GAC CCC TTC ATG GAA CAA CAG CAC GCC CCG GCC TCT GCT 
2523 2532 2541 2550 2559 2568 

PFQATLPQRNS P FKTAPFKP 
CCC TTC CAG GCC ACC CTT CCC CAG CGC AAC AGC CCT TTC AAA ACC GCC CCG TTC AAG CCA 
2583 2592 2601 2610 2619 2628 

VHGHDYFDDE DDDADLLAAV 
GTC CAC GGC CAC GAT TAC TTT GAC GAT GAA GAC GAC GAT GCC GAC CTC CTG GCA GCA GTA 
2643 2652 2661 2670 2679 2688 

DSAE TYTSTAATTTTNNNNH 
GAC AGC GCC GAG ACG TAT ACT TCT ACG GCC GCC ACC ACC ACC ACC AAC AAC AAC AAT CAC 
2703 2712 2721 2730 ■ 2739 2748 

LRSQSVMSTSTATTI KPRKR 
TTA CGA TCA CAA TCG GTG ATG TCA ACA TCC ACG GCG ACC ACG ATC AAA CCG AGG AAA CGC 
2763 2772 2781 2790 2799 2808 

NENANAKKPKSVHAKLSMPP 
AAC GAA AAT GCC AAT GCC AAG AAG CCC AAG TCC GTA CAT GCA AAG TTA TCG ATG CCG CCC 
2823 2832 2841 2850 2859 2868 

EKMKYAWSNDVRKA L K D R F R 
GAA AAG ATG AAG TAT GCG TGG TCG AAT GAT GTG AGG AAG GCT CTC AAG GAT AGG TTT CGG 
2883 2892 2901 2910 2919 2928 

MSGFRQNQLEAINATLGGK 

ATG TCG GGG TTC AGA CAG AAT CAG TTG GAG GCT ATT AAT GCT ACT TTG GGT GGT A AG GTG 
2943 2952 2961 2970 2979 2988 

AGT TCT CTG TCC TTT ACC TAT CTG GGA GAG ACC AAG AAG GAG AGA GAG AGA GAG AGG AGG 

3003 3012 3021 3030 3039 3048 

DAFVLMPTG 
GGA AGA CGA AAA TGG ACT T TG CTG AC T CTA GA AAG G AT GCC TTT GTG TTG ATG CCG ACT GGT 
3065 3074 3083 " 3092 3101 3110 

GGKS LCYQIiPAVVRS GKTRG 

GGT GGA AAG TCT CTG TGC TAT CAG TTG CCG GCT GTA GTC AGG AGC GGC AAG ACG CGT GGT 
3125 3134 3143 3152 3161 3170 

ITVV I SPLLSLMLPQVNHIiA 

ATC ACA GTC GTC ATC TCC CCT CTG CTA AGT CTG ATG CTG GAT CAA GTC AAC CAT TTG GCA 
3185 3194 3203 3212 3221 3230 
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QQPI TLGKLCDILMGKRKNE 
CAG CAG CCC ATC ACG CTG GGC AAG CTG TGC GAC ATC CTG ATG GGC AAG AGA AAG AAC GAG 
4325 4334 4343 4352 4361 4370 

HGGVCH F G I AKGS T Q R E LQR 
CAC GGT GGC GTG TGT CAC TTT GGT ATC GCC AAG GGG AGC ACG CAG AGG GAG CTG CAG AGG 
4385 4394 4403 4412 4421 4430 

IVLQLNFHKALGEDN IMNGA 
ATC GTG CTG CAG CTG AAT TTC CAC AAG GCG CTG GGC GAG GAC AAT ATC ATG AAT GGG GCG 
4445 4454 4463 4472 4481 4490 

GMPITYYI 

GGG ATG CCT ATT ACC TAC TAT ATT GTG AG T GCT GTC CCG GTT GGT CTT GCA TAT CTG GCT 
4505 4514 4523 4532 4541 4550 

/INSERZIONE 
TGPEAGAYL 
TTG TTG CTT TGC TAA CA C AGC AGC TCG TAC AG ACC GGC CCT GAA GCT GGT GCT TAC CTC 
4564 4573 4582 4591 4600 4609 

YNGKRLMLPVPSNKSVEPPS 
TAC AAT GGC AAG CGG TTG ATG CTG CCA GTT CCC TCA AAC AAG TCC GTC GAA CCC CCG TCT 
4624 4633 4642 4651 4660 4669 

RS KQRSRRVDEDMDE QELST 
CGG TCT AAG CAG CGG AGC CGT CGA GTC GAT GAG GAT ATG GAT GAG CAA GAA CTT TCC ACC 
4684 4693 4702 4711 4720 4729 

LQRP PTSTNVS SPVRATKKR 
CTG CAA CGA CCG CCA ACA TCA ACA AAT GTC TCT TCA CCC GTT CGA GCC ACC AAG AAA CGA 
4744 4753 4762 4771 4780 4789 

SSKKALPTLIADYEEP SSDG 
AGT TCC AAA AAG GCT TTA CCG ACC CTC ATC GCC GAC TAC GAA GAG CCC AGC TCC GAC GGT 
4804 4813 4822 4831 4840 4849 

PHGPLHANGYERDNFVVSDN 
CCT CAC GGT CCT CTC CAC GCC AAC GGC TAT GAG CGC GAT AAC TTT GTC GTA TCC GAT AAT 
4864 4873 4882 4891 4900 4909 

VEPEEEEDAFEPVRPSRRGP 
GTT GAA CCC GAA GAG GAA GAA GAT GCC TTC GAA CCT GTC CGC CCC TCG CGG CGC GGC CCA 
4924 4933 4942 4951 4960 4969 

SSRATRPQHRQTTLYDTLSH 
TCT TCT CGC GCT ACC CGC CCT CAA- CAC CGC CAG ACC ACC CTT TAT GAC ACC CTC TCC CAC 
4984 4993 5002 5011 5020 5029 

T QQ S Q TVS QHLAT L G P P I DA 
ACC CAA CAA TCC CAA ACC GTC TCC CAA CAC CTC GCC ACT TTG GGT CCG CCC ATC GAC GCC 
5044 5053 5062 5071 5080 5089 

RTMHNPRYAQLDEVHQDIVD 
CGC ACC ATG CAT AAC CCC CGC TAC GCC CAG CTT GAC GAG GTC CAC CAG GAT ATT GTC GAT 
5104 5113 5122 5131 5140 5149 

AFVEEVKVFEEDFRNRNHMR 
GCC TTT GTT GAA GAA GTC AAG GTC TTC GAG GAG GAC TTT CGC AAC AGG AAC CAC ATG CGC 
5164 5173 5182 5191 5200 5209 

KPI FT ETQYREMAI RWTRSL 
AAA CCC ATC TTT ACC GAG ACG CAG TAC CGT GAG ATG GCA ATC CGG TGG ACG CGG TCG TTA 
5224 5233 5242 5251 5260 5269 

DAMRAI PDINQDKVDRYGAK 
GAC GCG ATG CGC GCG ATC CCG GAT ATC AAC CAG GAT AAA GTA GAT CGG TAT GGT GCC AAA 
5284 5293 5302 5311 5320 5329 

FI PLVERFWGNYQEMMGGGY 
TTC ATC CCA CTT GTG GAG CGG TTC TGG GGG AAT TAT CAG GAG ATG ATG GGG GGA GGG TAT 
5344 5353 5362 5371 5380 5389 
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DNPAVAGDEDDDE. GPRRTGN 
GAT AAT CCT GCT GTG GCT GGC GAT GAG GAT GAT GAT GAG GGC CCC AGG AGG ACA GGA AAT 
5404 5413 5422 5431 5440 5449 

GKGGNKKGGGGGGGNEVVDL 
GGA AAA GGG GGG AAT AAG AAG GGA GGA GGA GGA GGA GGA GGA AAT GAA GTA GTG GAT TTG 
5464 5473 5482 5491 5500 5509 

ISSDEDEPPARAPSRNAGRG 
ATT AGT AGT GAT GAG GAT GAA CCT CCG GCT CGT GCA CCA TCG CGG AAT GCG GGG CGA GGA 
5524 5533 5542 5551 5560 5569 

KAQSTRGGQIQDKGRAVNRR 
AAG GCA CAG TCG ACA CGT GGG GGA CAA ATC CAA GAT AAA GGC CGA GCA GTC AAC CGC CGC 
5584 5593 5602 5611 5620 5629 

GEPIAEED.EEDYGLSDPDID 
GGA GAA CCC ATC GCC GAA GAA GAC GAA GAA GAC TAC GGG CTA AGC GAC CCC GAT ATC GAC 
5644 5653 5662 5671 5680 5689 

AIDPDAITASDNSDEEDDDD 
GCC ATC GAT CCA GAC GCC ATC ACC GCC TCC GAC AAC TCC GAC GAA GAA GAT GAT GAT GAT 
5704 5713 5722 5731 5740 5749 

DDEDLESSRYFSGSTGPPVS 
GAT GAC GAA GAC CTC GAA TCC TCC CGC TAC TTC TCC GGC TCA ACA GGC CCG CCC GTC TCC 
5764 5773 5782 5791 5800 5809 

KAVQDARLREQLSMYASGGS 
AAA GCC GTG CAG GAT GCT CGA CTC CGT GAA CAA CTT TCC ATG TAC GCC TCC GGC GGC AGC 
5824 5833 5842 5851 5860 5869 

SSKGSYGSGRASGGS S SRAS 
TCT TCG AAA GGT AGC TAC GGC TCA GGG CGC GCA TCA GGA GGA TCT TCG TCG AGA GCG TCG 
5884 5893 5902 5911 5920 5929 

GSGWRGGG AGGKKYYRKKRA 
GGA TCA GGA TGG AGA GGT GGA GGA GCA GGT GGG AAG AAA TAC TAC AGG AAG AAG AGG GCT 
5944 5953 5962 5971 5980 5989 

G S SAAGGGGAGGGGVT KRKA 
GGT TCT TCG GCT GCT GGT GGT GGT GGT GCA GGA GGA GGG GGA GTT ACA AAA CGG AAG GCG 
6004 6013 6022 6031 6040 6049 

SGSGAKTARKRGAS TAPKTT 
AGT GGG AGT GGC GCG AAG ACG GCG AGG AAG AGG GGT GCA TCT ACT GCG CCG AAG ACA ACG 
6064 6073 6082 6091 6100 6109 

TRGGGS GAGS RGGGAGGAGG 
ACG AGA GGG GGA GGA TCT GGA GCT GGG TCT AGA GGA GGC GGT GCT GGT GGT GCT GGT GGT 
6124 6133 6142 6151 6160 6169 

AGAGAGAGGGKRGGGGG GGM 
GCT GGT GCT GGT GCT GGT GCT GGA GGA GGG AAA AGG GGT GGT GGA GGT GGA GGA GGA ATG 
6184 6193 6202 6211 6220 6229 

GGISVMPH. 

GGA GGG ATA AGT GTT ATG CCT CAT TAG CTA TTT TAT AGC ATA TCG CAT TTA TAC AGT GTC 
6244 6253 6262 6271 6280 6289 

TTA TGG AAG GGA GGA GGA GAA GAA GAA GGA TAA GCT GGC ATA AGC TTG AAC CGG CCA GGC 
6304 6313 6322 6331 6340 6349 

CAA AAT GGC CAG AGA GCT CAC CGG GCA ATC GAG CTT GAA ATG AGC TTG ACA TAT TAG GTA 
6364 6373 6382 6391 6400 6409 

TTC CCG AGA ATA TAG CGG GAT TAC AAG GCA CTT ACT TTA CCA AGT CGA AAG GGA CGA GCC 
6424 6433 6442 6451 6460 6469 

AAA TCT ATG GTA CTC GCC AGT TGC GCA ACG TTG AGT TTT ATC ATT CGT GGA GTT TTC ATC 
6484 6493 6502 6511 6520 6529 
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GTG GAG TTT TTA TTA TCA ACT ATT CGT TGT ATA GTT TTC GTT GTA GAT GTT AGT TCC GGA 
6544 6553 6562 6571 6580 6589 

CGA TCA AAA GGG GAA GTG TGG AAC AGA GAA GTC GAA AGG ACA AGC CAA AAT GAC ATG GCA 
6604 6613 6622 6631 6640 6649 

GTG TCC AGT CAG ATA CCC TCC AGA CAA AAC CAG ACA CCA ATA ACA AAC CCT TCA ACC ATA 
6664 6673 6682 6691 6700 6709 

ACA CCA GCA AAG CCA ATC CTT AGG TAC CTA CCT AGG GTA GGG TAG GTC CAG GAA TGT CTT 
6724 6733 6742 6751 6760 6769 

CCC CAA AGG TAC CTC TAC TTA TTC ATG TTA CGC TCC ATC AGT CCC ATC GCT TAG CAT CGC 
6784 6793 6802 6811 6820 6829 

TGC CCG GTT ACC TAT CTC TAC CTC TAC CTC TAC CTC TAC CTC TAC CTC TAC CTC TAC CTC 
6844 6853 6862 6871 6880 6889 

TAT CTC TAC CTC TAC CTC TAC 
6904 6913 
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MAKLSVKNNLP RPHLVSL 
ATG GCG AAG CTC TCA GTC AAG AAC AAC CTG CCA CGG CCG CAC TTG GTC TCC TTG 

9 18 27 36 45 54 

SSSTTGS GSGSASRSASAKH 
TCG TCG TCA ACG ACA GGC TCT GGG TCT GGT TCT GCG TCT AGG TCA GCT TCT GCT AAG CAC 
63 72 81 90 99 108 

GSAGSSTFDHEQHQQHQQQQ 
GGA AGT GCC GGT TCC AGT ACC TTT GAT CAT GAA CAA CAT CAA CAA CAT CAA CAA CAA CAA 
123 132 141 150 159 168 

QQKRQRSQSEARQQQqqqqq 
CAA CAA AAG CGC CAG CGG TCG CAA TCA GAA GCA CGA CAA CAG CAG CAG CAA CAG CAA CAG 
183 192 201 210 219 228 

QQQQQQQQQQAQHHAHSTYA 
CAA CAG CAA CAG CAA CAA CAA CAA CAA CAA GCA CAG CAC CAT GCA CAT TCT ACA TAT GCA 
243 252 261 270 279 288 

QRPQPTPQQRPP QNLLXPAS 
CAA AGA CCC CAA CCC ACC CCC CAA CAA CGA CCA CCC CAA AAC CTA CTG ACA CCT GCT TCA 
303 312 321 330 339 348 

TTGASVGPLQRAYSVS LAAR 
ACC ACT GGT GCC AGC GTC GGC CCG CTC CAA CGC GCA TAC TCG GTT TCA TTA GCT GCG AGA 
363 372 381 390 399 408 

QSPSTNLVRPKTDSPAPHTL 
CAG TCC CCC TCG ACA AAC TTG GTC CGT CCA AAG ACC GAC TCG CCA GCT CCC CAC ACT TTA 
423 432 441 450 459 468 

HLKNKKNL RHPAPTPDSPIV 
CAC CTC AAG AAC AAG AAG AAC CTC CGT CAC CCC GCC CCC ACG CCC GAC AGT CCG ATC GTA 
483 492 501 510 519 528 

DD »IFSDAVDLTEELDHDHD 
GAC GAC GAT ATT TTC TCC GAC GCC GTC GAT CTT ACC GAA GAA CTC GAT CAT GAC CAT GAT 
543 552 561 570 579 588 

LNGKDKDN TDNDNTVASSSL 
CTC AAC GGC AAA GAC AAA GAC AAC ACC GAC AAC GAC AAC ACA GTC GCT TCC AGT TCG CTA 
6 °3 612 621 630 639 648 

IGFGDDKLLWREDFAERAEP 
ATA GGG TTC GGC GAT GAC AAG TTA CTG TGG CGA GAG GAC TTT GCT GAG CGT GCA GAG CCC 
663 672 681 690 699 708 

EHERGGSRPRQVKKRKI SND 
GAA CAT GAA AGA GGT GGG AGC AGG CCT CGC CAG GTC AAG AAA CGG AAG ATA TCG AAT GAC 
7 23 732 741 750 759 768 

YI MK.DEDVSLFDDDGEEDEF 
TAC ATT ATG AAG GAT GAG GAT GTC TCG CTT TTT GAT GAT GAT GGC GAG GAG GAC GAG TTT 
783 792 801 810 819 828 

MDIN ELVQGDRESTPKPKAT 
ATG GAT ATC AAT GAG CTA GTT CAG GGG GAT CGG GAA AGT ACT CCG AAG CCA AAG GCT ACA 
843 852 861 870 879 888 
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SRSVSTRLPPTVSLQRGRSP 
TCG AGG TCT GTC TCG ACG AGG CTG CCG CCT ACA GTA TCG CTG CAA CGG GGT CGG TCT CCT 



903 912 921 930 939 



948 



KRKEASVEKRTTENQQQADR 
AAG AGG AAG GAG GCT TCA GTT GAA AAG CGC ACA ACG GAA AAC CAG CAA CAG GCT GAC AGA 
963 972 981 990 999 



1008 



EDEPSFMSSPDVDNSRRRKS 
GAA GAC GAA CCG TCG TTT ATG TCA AGT CCA GAT GTC GAC AAC TCC CGC AAG CGA AAG TCT 
1023 1032 1041 1050 1059 ~ 



1068 



SGSPT- GLTTPRPQQKQTEEV 
TCT GGA TCG CCC ACA GGT TTA ACG ACG CCA AGA CCC CAG CAG AAG CAA ACG GAA GAG GTC 
1083 1092 1101 1110 1119 H28 

PGTTTAKKPRRSEVMDSEDE 
CCA GGT ACG ACC ACC GCC AAG AAG CCA CGG CGC AGT GAA GTG ATG GAC TCG GAG GAC GAG 
1143 1152 1161 1170 1179 1188 

AFTPLSAGSLPGSAEFFRSG 
GCA TTC ACT CCT CTT TCT GCT GGG TCG CTG CCT GGG AGT GCG GAG TTC TTC AGA AGC GGT 
1203 1212 1221 . 1230 1239 1248 

GTTTRELG LDEDTVMDTPSR 
GGG ACC ACC ACA CGG GAA TTG GGT TTG GAC GAA GAC ACG GTT ATG GAC ACG CCT AGT AGG 



1263 1272 1281 1290 1299 



1308 



p PVESTLPTLESV ESRPPPL 
CCA CCG GTC GAG TCC ACT TTG CCA ACT CTC GAG TCT GTG GAA AGT CGA CCA CCC CCC CTG 



1323 1332 1341 1350 1359 



1368 



PPM DLPSQRKPLEPLNTPRN 
CCG CCC ATG GAT CTA CCA TCA CAG CGA AAA CCG CTA GAG CCG TTG AAC ACT CCG CGC AAC 
1383 1392 1401 1410 1419 ~ ~ " 



1428 



QLLESVERPTQQPSVGPSFA 
CAG CTG CTT GAG TCG GTC GAA AGG CCA ACA CAG CAG CCG TCG GTG GGG CCG AGT TTT GCA 
1443 1452 1461 .1470 1479 1488 

QSSTLAES.SLPPSMPPPSED 
CAG AGT AGC ACA CTC GCC GAA AGC TCC CTG CCG CCG TCA ATG CCG CCG CCA AGT GAA GAC 



1503 1512 1521 1530 " 1539 



1548 



PLN TRENSNLEEFDYKLYKP 
CCC CTC AAC ACC AGG GAG AAC AGC AAC CTT GAG GAG TTC GAC TAC AAG CTT TAC AAA CCC 



1563 1572 1581 1590 1599 



1608 



LLDLFV NAPAILERELSAVN 
CTG CTA GAT CTT TTC GTC AAC GCA CCC GCA ATC TTG GAA AGA GAA CTG AGC GCC GTT AAT 
1623 1632 1641 1650 1659 1668 

DEL QENMIKLRDCLRLPREE 
GAC GAG CTT CAG GAG AAC ATG ATC AAG CTG CGG GAC TGT CTG CGC CTG CCC AGG GAA GAA 
1683 1692 1701 1710 1719 1728 

RDRAREEVKKEKEMLKRRD I 
AGA GAC AGG GCA CGC GAA GAG GTG AAG AAG GAA AAG GAA ATG CTC AAG CGA CGG GAC ATT 
1743 1752 1761 1770 1779 1788 

ALRALQDEHKLYVKKRKEHN 
GCG CTC AGA GCC CTC CAG GAC GAA CAC AAG TTG TAC GTC AAG AAA CGC AAA GAG CAT AAT 
1803 1812 1821 1830 1839 1848 
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LINEEIVRAYAEEDDEYEDQ 
TTG ATC AAC GAG GAA ATC GTT CGC GCT TAT GCT GAA GAA GAC GAT GAG TAC GAG GAT CAG 
1863 1872 1881 1890 1899 1908 

LMAQLDKLDDEVEAI VKSLT 
TTA ATG GCG CAG CTG GAC AAG TTG GAT GAT GAG GTT GAG GCT ATC GTA AAG AGT CTG ACA 
1923 1932 1941 1950 1959 1968 

RLIVAAGITEKSFDLKKEEE 
AGG CTT ATT GTG GCG GCG GGG ATC ACG GAG AAG AGC TTT GAC CTA AAG AAG GAG GAG GAA 
1983 1992 2001 2010 2019 2028 

EEEEKP II IATPTP STRTEA 
GAG GAG GAG GAG AAG CCG ATC ATC ATA GCG ACT CCG ACA CCT TCG ACG AGG ACC GAG GCC 
2043 2052 2061 2070 2079 2088 

PVLPTTEYHNSQQVI LQTQH 
CCG GTT CTG CCG ACG ACC GAG TAT CAT AAT TCC CAG CAG GTC ATA TTG CAG ACT CAA CAT 
2103 2112 2121 2130 2139 2148 
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SEQUENCE LISTING 



<110> Universita degli studi di Roma La Sapienza 
Macino, Giuseppe 
Cogoni, Carlo 

<120> Isolation and characterization of a N. crassa silencing 
gene and uses therof 

<130> PC 

<140> 
<141> 

<160> 3 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 6916 
<212> DNA 

<213> Neurospora crassa 
<400> 1 

ccaccaccac caccaccacc acaacaacca actcaacaac gaactcaaca accaacccca 60 
aacccaacct cgacctcaac ctcaacctca acccttgcga cctcgagata cacaaacaca 120 
tcctcgacaa atgacgcccg acccgctaca cgccaacaga ttgcccccga ggtaggagca 180 
tcgacgcatc aggattcagt tggacttgga gaaggaggag gaggaggaat ggcgaagctc 240 
tcagtcaaga acaacctgcc acggccgcac ttggtctcct tgtcgtcgtc aacgacaggc 300 
tctgggtctg gttctgcgtc taggtcagct tctgctaagc acggaagtgc cggttccagt 360 
acctttgatc atgaacaaca tcaacaacat caacaacaac aacaacaaaa gcgccagcgg 420 
tcgcaatcag aagcacgaca acagcagcag caacagcaac accaacagca acagcaacaa 4 80 
caacaacaac aagcacagca ccatgcacat tctacatatg cacaaagacc ccaacccacc 540 
ccccaacaac gaccacccca aaacctactg acacctgctt caaccactgg tgccagcgtc 600 
ggcccgctcc aacgcgcata ctcggtttca ttagctgcga gacagtcccc ctcgacaaac 660 
ttggtccgtc caaagaccga ctcgccagct ccccacactt tacacctcaa gaacaagaag 720 
aacctccgtc accccgcccc cacgcccgac agtccgatcg tagacgacga tattttctcc 780 
gacgccgtcg atcttaccga agaactcgat catgaccatg auctcaacgg caaagacaaa 840 
gacaacaccg acaacgacaa cacagtcgct tccagttcgc taatagggtt cggcgatgac 900 
aagttactgt ggcgagagga ctttgctgag cgtgcagagc ccgaacatga aagaggtggg 960 
agcaggcctc gccaggtcaa gaaacggaag atatcgaatg actacattat gaaggatgag 1020 
gatgtctcgc tttttgatga tgatggcgag gaggacgagt ttatggatat caatgagcta 1080 
gttcaggggg atcgggaaag tactccgaag ccaaaggcta cancgaggtc tgtcrcgacg 1140 
aggctgccgc ctacagtatc gctgcaacgg ggtcggtctc ctaagaggaa ggaggcttca 1200 
gttgaaaagc gcacaacgga aaaccagcaa caggctgaca gagaagacga accgtcgttt 1260 
atgtcaagtc cagatgtcga caactcccgc aagcgaaagt cttctggatc gcccacaggt 1320 
ttaacgacgc caagacccca gcagaagcaa acggaagagg tcccaggtac gaccaccgcc 1380 
aagaagccac ggcgcagtga agtgatggac tcggaggacg aggcattcac tcctcrttct 1440 



1 



gctgggtcgc tgcctgggag tgcggagttc ttcagaagcg gtgggaccac cacacgggaa 1500 
ttgggtttgg acgaagacac ggttatggac acgcctagta ggccaccggt cgagtccact 1560 
ttgccaactc tcgagtctgt ggaaagtcga ccaccccccc tgccgcccat ggatctacca 1620 
tcacagcgaa aaccgctaga gccgttgaac actccgcgca accagctgct tgagtcggtc 1680 
gaaaggccaa cacagcagcc gtcggtgggg ccgagttttg cacagagtag cacactcgcc 1740 
gaaagctccc tgccgccgtc aatgccgccg ccaagtgaag accccctcaa caccagggag 1800 
aacagcaacc ttgaggagtt cgactacaag ctttacaaac ccctgctaga tcttttcgtc 1860 
aacgcacccg caatcttgga aagagaactg agcgccgtta atgacgagct tcaggagaac 1920 
atgatcaagc tgcgggactg tctgcgcctg cccagggaag aaagagacag ggcacgcgaa 1980 
gaggtgaaga aggaaaagga aatgctcaag cgacgggaca ttgcgctcag agccctccag 2040 
gacgaacaca agttgtacgt caagaaacgc aaagagcata atttgatcaa cgaggaaatc 2100 
gttcgcgctt atgctgaaga agacgatgag tacgaggatc agttaatggc gcagctggac 2160 
aagttggatg atgaggttga ggctatcgta aagagtctga caaggcttat tgtggcggcg 2220 
gggatcacgg agaagagctt tgacctaaag aaggaggagg aagaggagga ggagaagccg 2280 
atcatcatag cgactccgac accttcgacg aggaccgagg ccccggttct gccgacgacc 2340 
gagtatcata attcccagca ggtcatattg cagactcaac atcctgctgc gcagcaggtt 2400 
tctcaccggg tgccaccacc tccgacaccg agttttcaaa cggcgcgcca gactccggtg 24 60 
tcatatcaga gcagaccgac caacaactcc tttcctgata tctcggcgga agaagccatg 2520 
atgttcgata aagaagaccc cttcatggaa caacagcacg ccccggcctc tgctcccttc 2580 
caggccaccc ttccccagcg caacagccct ttcaaaaccg ccccgttcaa gccagtccac 2640 
ggccacgatt actttgacga tgaagacgac gatgccgacc tcctggcagc agtagacagc 2700 
gccgagacgt atacttctac ggccgccacc accaccacca acaacaacaa tcacttacga 27 60 
tcacaatcgg tgatgtcaac atccacggcg accacgatca aaccgaggaa acgcaacgaa 2820 
aatgccaatg ccaagaagcc caagtccgta catgcaaagt tatcgatgcc gcccgaaaag 2880 
atgaagtatg cgtggtcgaa tgatgtgagg aaggctctca aggataggtt tcggatgtcg 294 0 
gggttcagac agaatcagtt ggaggctatt aatgctactt tgggtggtaa ggtgagttct 3000 
ctgtccttta cctatctggg agagaccaag aaggagagag agagagagag gaggggaaga 3060 
cgaaaatgga ctttgctgac tctagaaagg atgcctttgt gttgatgccg actggtggtg 3120 
gaaagtctct gtgctatcag ttgccggctg tagtcaggag cggcaagacg cgtggtatca 318.0 
cagtcgtcat ctcccctctg ctaagtctga tgctggatca agtcaaccat ttggcaaacc 3240 
tgatgatcca agcttacgct ttcaacggag acatgaactc agaaatgcgc cgaatggtgt 3300 
ttcagaagct tgatgctgag catcctgagc atgagctcca actgctctat gtcaccccgg 3360 
agatggtgag caagaaccag acattcgtca acaagatgat ggacctctac cggaggaaaa 3420 
agctggctag aattgttatc gacgaggctc actgcgtcag tcaatggggc catgacttcc 34 80 
gacccgatta caaagctata ggagagtttc gtaagaggtc tcccggagtt ccggtcatgg 3540 
ctttgacagc gacggcaaca cagaacgtca tcttggatgt caagcataac ctggcaatgg 3600 
aggactgcca gactttctcc cagagcttta atcggccgaa cctctactat gaggtcagga 3660 
tgaaggagca gaatctgatt gcccgcatcg cagagttgar caaggagaag tatgacggcc 3720 
agacgggtat catctacaca ttatcaagaa agagtgccga gaacatcgcc aaaaatctcc 3780 
aggaaaaaca ccgcatcaaa gcaaagcact accatgcctc catcaccacc gacgaaaaga 3840 
tcagcgttca acatgaatgg caaaccggcc gagtcaaagt cgtggtagcc accattgcct 3900 
tcggcatggg catcgacaag cctgacgtcc gctttgttar ccaccagcac atccccaagt 3960 
cgctcgaagg ttactatcaa gaaaccggcc gcgccggacc tgacggcaag ccatcggact 4020 
gctacttgta ctttgcctat ggcgacattc aatccctacg tcgtatgatc gccgacggcg 4080 
aaggtgacta cgcgcaaaag gagcgtcagc tacaaatgct caaccgtgtg gtcagctatt 4140 
gcgagtcgca gcacacgtgc cggcgcgaag aagtgctccg ctactttggc gaggagtttg 4200 
actaccggaa gtgtagagac ggatgcgata actgccggaa cggacgcatc tcgaagtcga 4260 
cggagatgag ggattttacg gaaatcgcct tcgccgcgac cgaggtggtg aagagccagc 4 320 
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agcccatcac gctgggcaag ctgtgcgaca 
gtggcgtgtg tcactttggt atcgccaagg 
tgctgcagct gaatttccac aaggcgctgg 
tgcctattac ctactatatt gtgagtgctg 
tgctttgcta acacagcagc tcgtacagac 
tggcaagcgg ttgatgctgc cagttccctc 
taagcagcgg agccgtcgag tcgatgagga 
acgaccgcca acatcaacaa atgtctcttc 
caaaaaggct ttaccgaccc tcatcgccga 
cggtcctctc cacgccaacg gctatgagcg 
acccgaagag gaagaagatg ccttcgaacc 
tcgcgctacc cgccctcaac accgccagac 
acaatcccaa accgtctccc aacacctcgc 
catgcataac ccccgctacg cccagcttga 
tgttgaagaa gtcaaggtct tcgaggagga 
catctttacc gagacgcagt accgtgagat 
gatgcgcgcg atcccggata tcaaccagga 
cccacttgtg gagcggttct gggggaatta 
tcctgctgtg gctggcgatg aggatgatga 
aggggggaat aagaagggag gaggaggagg 
tagtgatgag gatgaacctc cggctcgtgc 
acagtcgaca cgtgggggac aaatccaaga 
acccatcgcc gaagaagacg aagaagacta 
cgatccagac gccatcaccg cctccgacaa 
cgaagacctc gaatcctccc gctacttctc 
cgtgcaggat gctcgactcc gtgaacaact 
gaaaggtagc tacggctcag ggcgcgcatc 
aggatggaga ggtggaggag caggtgggaa 
ttcggctgct ggtggtggtg gtgcaggagg 
gagtggcgcg aagacggcga ggaagagggg 
agggggagga tctggagctg ggtctagagg 
tgctggtgct ggtgctggag gagggaaaag 
gataagtgtt atgcctcatt agctatttta 
gaagggagga ggagaagaag aaggataagc 
tggccagaga gctcaccggg caatcgagct 
gagaatatag cgggattaca aggcacttac 
tatggtactc gccagttgcg caacgttgag 
gtttttatta tcaactattc gttgtatagt 
aaaaggggaa gtgtggaaca gagaagtcga 
cagtcagata ccctccagac aaaaccagac 
agcaaagcca atccttaggt acctacctag 
aaggtacctc tacttattca tgttacgctc 
ggttacctat ctctacctct acctctacct 
ctacctctac ctctac 



tcctgatggg caagagaaag aacgagcacg 4380 
ggagcacgca gagggagctg cagaggatcg 4440 
gcgaggacaa tatcatgaat ggggcgggga 4500 
tcccggttgg tcttgcatat ctggctttgt 4560 
cggccctgaa gctggtgctt acctctacaa 4620 
aaacaagtcc gtcgaacccc cgtctcggtc 4 680 
tatggatgag caagaacttt ccaccctgca 4740 
acccgttcga gccaccaaga aacgaagttc 4800 
ctacgaagag cccagctccg acggtcctca 4860 
cgataacttt gtcgtatccg ataatgttga 4920 
tgtccgcccc tcgcggcgcg gcccatcttc 4980 
caccctttat gacaccctct cccacaccca 5040 
cactttgggt ccgcccatcg acgcccgcac 5100 
cgaggtccac caggatattg tcgatgcctt 5160 
ctttcgcaac aggaaccaca tgcgcaaacc 5220 
ggcaatccgg tggacgcggt cgttagacgc 5280 
taaagtagat cggtatggtg ccaaattcat 5340 
tcaggagatg atggggggag ggtatgataa 5400 
tgagggcccc aggaggacag gaaatggaaa 54 60 
aggaggaaat gaagtagtgg atttgattag 5520 
accatcgcgg aatgcggggc gaggaaaggc 5580 
taaaggccga gcagtcaacc gccgcggaga 5640 
cgggctaagc gaccccgata tcgacgccat 5700 
ctccgacgaa gaagatgatg atgatgatga 5760 
cggctcaaca ggcccgcccg tctccaaagc 5820 
ttccatgtac gcctccggcg gcagctcttc 5880 
aggaggatct tcgtcgagag cgtcgggatc 5940 
gaaatactac aggaagaaga gggctggttc 6000 
agggggagtt acaaaacgga aggcgagtgg 6060 
tgcatctact gcgccgaaga caacgacgag 6120 
aggcggtgct ggtggtgctg gtggtgctgg 6180 
gggtggtgga ggtggaggag gaatgggagg 6240 
tagcatatcg catttataca gtgtcttatg 6300 
tggcataagc ttgaaccggc caggccaaaa 6360 
tgaaatgagc ttgacatatt aggtattccc 6420 
tttaccaagt cgaaagggac gagccaaatc 6480 
ttttatcatt cgtggagttt tcatcgtgga 6540 
tttcgttgta gatgttagtt ccggacgatc 6600 
aaggacaagc caaaatgaca tggcagtgtc 6660 
accaataaca aacccttcaa ccataacacc 6720 
ggtagggtag gtccaggaat gtcttcccca 6780 
catcagtccc atcgcttagc atcgctgccc 6840 
ctacctctac ctctacctct acctctatct 6900 

6916 



<210> 2 
<211> 5868 
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<212> DNA 

<213> Neurospora crassa 

<220> 

<221> CDS 

<222> (1) (5868) 

<400> 2 

atg gcg aag etc tea gtc aag aac aac ctg cca egg ccg cac ttg gtc 4 8 

Met Ala Lys Leu Ser Val Lys Asn Asn Leu Pro Arg Pro His Leu Val 
15 10 15 

tec ttg teg teg tea acg aca ggc tct ggg tct ggt tct gcg tct agg 96 
Ser Leu .Ser Ser Ser Thr Thr Gly Ser Gly Ser Gly Ser Ala Ser Arg 
20 25 30 

tea get tct get aag cac gga agt gee ggt tec agt acc ttt gat cat 144 
Ser Ala Ser Ala Lys His Gly Ser Ala Gly Ser Ser Thr Phe Asp His 
35 40 45 

gaa caa cat caa caa cat caa caa caa caa caa caa aag cgc cag egg 192 
Glu Gin His Gin Gin His Gin Gin Gin Gin Gin Gin Lys Arg Gin Arg 
50 55 60 

teg caa tea gaa gca cga caa cag cag cag caa cag caa cag caa cag 240 
Ser Gin Ser Glu Ala Arg Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
65 70 75 80 

caa cag caa caa caa caa caa caa gca cag cac cat gca cat tct aca 288 
Gin Gin Gin Gin Gin Gin Gin Gin Ala Gin His His Ala His Ser Thr 
85 90 95 

tat gca caa aga ccc caa ccc acc ccc caa caa cga cca ccc caa aac 336 
Tyr Ala Gin Arg Pro Gin Pro Thr Pro Gin Gin Arg Pro Pro Gin Asn 
100 105 110 

eta ctg aca cct get tea acc act ggt gee age gtc ggc ccg etc caa 384 
Leu Leu Thr Pro Ala Ser Thr Thr Gly Ala Ser Val Gly Pro Leu Gin 
115 120 125 

cgc gca tac teg gtt tea tta get gcg aga cag tec ccc teg aca aac 4 32 
Arg Ala Tyr Ser Val Ser Leu Ala Ala Arg Gin Ser Pro Ser Thr Asn 
130 135 140 

ttg gtc cgt cca aag acc gac teg cca get ccc cac act tta cac etc 4 80 
Leu Val Arg Pro Lys Thr Asp Ser Pro Ala Pro His Thr Leu His Leu 
145 150 155 160 
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aag aac aag aag aac etc cgt cac ccc gec ccc acg ccc gac agt ccg 528 
Lys Asn Lys Lys Asn Leu Arg His Pro Ala Pro Thr Pro Asp Ser Pro 
165 170 175 

ate gta gac gac gat att ttc tec gac gee gtc gat ctt acc gaa gaa 576 
lie Val Asp Asp Asp lie Phe Ser Asp Ala Val Asp Leu Thr Glu Glu 
180 185 190 

etc gat cat gac cat gat etc aac ggc aaa gac aaa gac aac acc gac 624 
Leu Asp His Asp His Asp Leu Asn Gly Lys Asp Lys Asp Asn Thr Asp 
195 200 205 

aac gac aac aca gtc get tec agt teg eta ata ggg ttc ggc gat gac 672 
Asn Asp Asn Thr Val Ala Ser Ser Ser Leu lie Gly Phe Gly Asp Asp 
210 215 220 

aag tta ctg tgg cga gag gac ttt get gag cgt gca gag ccc gaa cat 720 
Lys Leu Leu Trp Arg Glu Asp Phe Ala Glu Arg Ala Glu Pro Glu His 
225 230 235 240 

gaa aga ggt ggg age agg cct cgc cag gtc aag aaa egg aag ata teg 768 
Glu Arg Gly Gly Ser Arg Pro Arg Gin Val Lys Lys Arg Lys lie Ser 
245 250 255 

aat gac tac att atg aag gat gag gat gtc teg ctt ttt gat gat gat 816 
Asn Asp Tyr lie Met Lys Asp Glu Asp Val Ser Leu Phe Asp Asp Asp 
260 265 270 

ggc gag gag gac gag ttt atg gat ate aat gag eta gtt cag ggg gat 864 
Gly Glu Glu Asp Glu Phe Met Asp lie Asn Glu Leu Val Gin Gly Asp 
275 280 285 

egg gaa agt act ccg aag cca aag get aca teg agg tct gtc teg acg 912 
Arg Glu Ser Thr Pro Lys Pro Lys Ala Thr Ser Arg Ser Val Ser Thr 
290 295 > 300 

agg ctg ccg cct aca gta teg ctg caa egg ggt egg tct cct aag agg 960 
Arg Leu Pro Pro Thr Val Ser Leu Gin Arg Gly Arg Ser Pro Lys Arg 
305 310 315 320 

aag gag get tea gtt gaa aag cgc aca acg gaa aac cag caa cag get 1008 
Lys Glu Ala Ser Val Glu Lys Arg Thr Thr Glu Asn Gin Gin Gin Ala 
325 330 335 

gac aga gaa gac gaa ccg teg ttt atg tea agt cca gat gtc gac aac 1056 
Asp Arg Glu Asp Glu Pro Ser Phe Met Ser Ser Pro Asp Val Asp Asn 
340 345 350 
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tec cgc aag cga aag tct tct gga teg ccc aca ggt tta acg acg cca 1104 
Ser Arg Lys Arg Lys Ser Ser Gly Ser Pro Thr Gly Leu Thr Thr Pro 
355 360 365 

aga ccc cag cag aag caa acg gaa gag gtc cca ggt acg acc acc gec 1152 
Arg Pro Gin Gin Lys Gin Thr Glu Glu Val Pro Gly Thr Thr Thr Ala 
370 * 375 380 

aag aag cca egg cgc agt gaa gtg atg gac teg gag gac gag gca ttc 1200 
Lys Lys Pro Arg Arg Ser Glu Val Met Asp Ser Glu Asp Glu Ala Phe 
385 390 395 400 

act cct ctt tct get ggg teg ctg cct ggg agt gcg gag ttc ttc aga 1248 
Thr Pro Leu Ser Ala Gly Ser Leu Pro Gly Ser Ala Glu Phe Phe Arg 
405 410 415 

age ggt ggg acc acc aca egg gaa ttg ggt ttg gac gaa gac acg gtt 1296 
Ser Gly Gly Thr Thr Thr Arg Glu Leu Gly Leu Asp Glu Asp Thr Val 
420 425 430 

atg gac acg cct agt agg cca ccg gtc gag tec act ttg cca act etc 1344 
Met Asp Thr Pro Ser Arg Pro Pro Val Glu Ser Thr Leu Pro Thr Leu 
435 440 445 

gag tct gtg gaa agt cga cca ccc ccc ctg ccg ccc atg gat eta cca 1392 
Glu Ser Val Glu Ser Arg Pro Pro Pro Leu Pro . Pro Met Asp Leu Pro 
450 455 460 

tea cag cga aaa ccg eta gag ccg ttg aac act ccg cgc aac cag ctg 1440 
Ser Gin Arg Lys Pro Leu Glu Pro Leu Asn Thr Pro Arg Asn Gin Leu 
465 470 475 480 

ctt gag teg gtc gaa agg cca aca cag cag ccg teg gtg ggg ccg agt 14 88 
Leu Glu Ser Val Glu Arg Pro Thr Gin Gin Pro Ser Val Gly Pro Ser 
485 490 495 

ttt gca cag agt age aca etc gee gaa age tec ctg ccg ccg tea atg 1536 
Phe Ala Gin Ser Ser Thr Leu Ala Glu Ser Ser Leu Pro Pro Ser Met 
500 505 510 

ccg ccg cca agt gaa gac ccc etc aac -acc agg gag aac age aac ctt 1584 
Pro Pro Pro Ser Glu Asp Pro Leu Asn Thr Arg Glu Asn Ser Asn Leu 
515 520 525 

gag gag ttc gac tac aag ctt tac aaa ccc ctg eta gat ctt ttc gtc 1632 
Glu Glu Phe Asp Tyr Lys Leu Tyr Lys Pro Leu Leu Asp Leu Phe Val 
530 535 540 
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aac gca ccc gca ate ttg gaa aga gaa ctg age gec gtt aat gac gag 1680 
Asn Ala Pro Ala lie Leu Glu Arg Glu Leu Ser Ala Val Asn Asp Glu 
545 550 555 560 

ctt cag gag aac atg ate aag ctg egg gac tgt ctg cgc ctg ccc agg 1728 
Leu Gin Glu Asn Met lie Lys Leu Arg Asp Cys Leu Arg Leu Pro Arg, 
565 570 575 

gaa gaa aga gac agg gca cgc gaa gag gtg aag aag gaa aag gaa atg 1776 
Glu Glu Arg Asp Arg Ala Arg Glu Glu Val Lys Lys Glu Lys Glu Met 
580 585 590 

etc aag cga egg gac att gcg etc aga gee etc cag gac gaa cac aag 1824 
Leu Lys Arg Arg Asp lie Ala Leu Arg Ala Leu Gin Asp Glu His Lys 
595 600 605 

ttg tac gtc aag aaa cgc aaa gag cat aat ttg ate aac gag gaa ate 1872 
Leu Tyr Val Lys Lys Arg Lys Glu His Asn Leu lie Asn Glu Glu lie 
610 615 620 

gtt cgc get tat get gaa gaa gac gat gag tac gag gat cag tta atg 1920 
Val Arg Ala Tyr Ala Glu Glu Asp Asp Glu Tyr Glu Asp Gin Leu Met 
625 630 635 640 

gcg cag ctg gac aag ttg gat gat gag gtt gag get ate gta aag agt 1968 
Ala Gin Leu Asp Lys Leu Asp Asp Glu Val Glu Ala He Val Lys Ser 
645 650 655 

ctg aca agg ctt att gtg gcg gcg ggg ate acg gag aag age ttt gac 2016 
Leu Thr Arg Leu He Val Ala Ala Gly He Thr Glu Lys Ser Phe Asp 
660 665 670 

eta aag aag gag gag gaa gag gag gag gag aag ccg ate ate ata gcg 2064 
Leu Lys Lys Glu Glu Glu Glu Glu Glu Glu Lys Pro He He He Ala 
675 680 685 

act ccg aca cct teg acg agg ace gag gee ccg gtt ctg ccg acg ace 2112 
Thr Pro Thr Pro Ser Thr Arg Thr Glu Ala Pro Val Leu Pro Thr Thr 
690 695 700 

gag tat cat aat tec cag cag gtc ata ttg cag act caa cat cct get 2160 
Glu Tyr His Asn Ser Gin Gin Val He Leu Gin Thr Gin His Pro Ala 
705 710 715 720 

gcg cag cag gtt tct cac egg gtg cca cca cct ccg aca ccg agt ttt 2208 
Ala Gin Gin Val Ser His Arg Val Pro Pro Pro Pro Thr Pro Ser Phe 
725 730 735 
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caa acg gcg cgc cag act ccg gtg tea tat cag age aga ccg acc aac 2256 
Gin Thr Ala Arg Gin Thr Pro Val Ser Tyr Gin Ser Arg Pro Thr Asn 
740 745 750 

aac tec ttt cct gat ate teg gcg gaa gaa gee atg atg ttc gat aaa 2304 
Asn Ser Phe Pro Asp He Ser Ala Glu Glu Ala Met Met Phe Asp Lys 
755 760 765 

gaa gac ccc ttc atg gaa caa cag cac gec ccg gec tct get ccc ttc 2352 
Glu Asp Pro Phe Met Glu Gin Gin His Ala Pro Ala Ser Ala Pro Phe 
770 775 780 

cag gec acc ctt ccc cag cgc aac age cct ttc aaa acc gec ccg ttc 2400 
Gin Ala Thr Leu Pro Gin Arg Asn Ser Pro Phe Lys Thr Ala Pro Phe 
785 790 795 800 

aag cca gtc cac ggc cac gat tac ttt gac gat gaa gac gac gat gec 2448 
Lys Pro Val His Gly His Asp Tyr Phe Asp Asp Glu Asp Asp Asp Ala 
805 810 815 

gac etc ctg gca gca gta gac age gee gag acg tat act tct acg gee 24 96 
Asp Leu Leu Ala Ala Val Asp Ser Ala Glu Thr Tyr Thr Ser Thr Ala 
820 825 830 

gee acc acc acc acc aac aac aac aat cac tta cga tea caa teg gtg 254 4 
Ala Thr Thr Thr Thr Asn Asn Asn Asn His Leu Arg Ser Gin Ser Val 
835 840 845 

atg tea aca tec acg gcg acc acg ate aaa ccg agg aaa cgc aac gaa 2592 
Met Ser Thr Ser Thr Ala Thr Thr He Lys Pro Arg Lys Arg Asn Glu 
850 855 860 

aat gec aat gee aag aag ccc aag tec gta car gca aag tta teg atg 2640 
Asn Ala Asn Ala Lys Lys Pro Lys Ser Val His Ala Lys Leu Ser Met 
865 870 875 880 

ccg ccc gaa aag atg aag tat gcg tgg teg aat gat gtg agg aag get 2688 
Pro Pro Glu Lys Met Lys Tyr Ala Trp Ser Asn Asp Val Arg Lys Ala 
885 890 895 

etc aag gat agg ttt egg atg teg ggg ttc aga cag aat cag ttg gag 2736 
Leu Lys Asp Arg Phe Arg Met Ser Gly Phe Arg Gin Asn Gin Leu Glu 
900 905 910 

get att aat get act ttg ggt ggt aag gat gee ttt gtg ttg atg ccg 2784 
Ala He Asn Ala Thr Leu Gly Gly Lys Asp Ala Phe Val Leu Met Pro 
915 920 925 
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act ggt ggt gga aag tct ctg tgc tat cag ttg ccg get gta gtc agg 2832 
Thr Gly Gly Gly Lys Ser Leu Cys Tyr Gin Leu Pro Ala Val Val Arg 
930 935 940 

age ggc aag acg cgt ggt ate aca gtc gtc ate tec cct ctg eta agt 2880 
Ser Gly Lys Thr Arg Gly lie Thr Val Val He Ser Pro Leu Leu Ser 
945 950 955 960 

ctg atg ctg gat caa gtc aac cat ttg gca aac ctg atg ate caa get 2928 
Leu Met Leu Asp Gin Val Asn His Leu Ala Asn Leu Met He Gin Ala 
965 970 975 

tac get ttc aac gga gac atg aac tea gaa atg cgc cga atg gtg ttt 2976 
Tyr Ala Phe Asn Gly Asp Met Asn Ser Glu Met Arg Arg Met Val Phe 
980 985 990 

cag aag ctt gat get gag cat cct gag cat gag etc caa ctg etc tat 3024 
Gin Lys Leu Asp Ala Glu His Pro Glu His Glu Leu Gin Leu Leu Tyr 
995 1000 1005 

gtc ace ccg gag atg gtg age aag aac cag aca ttc gtc aac aag atg 3072 
Val Thr Pro Glu Met Val Ser Lys Asn Gin Thr Phe Val Asn Lys Met 
1010 1015 1020 

atg gac etc tac egg agg aaa aag ctg get aga att gtt ate gac gag 3120 
Met Asp Leu Tyr Arg Arg Lys Lys Leu Ala Arg He Val He Asp Glu 
1025 1030 1035 1040 

get cac tgc gtc agt caa tgg ggc cat gac ttc cga ccc gat tac aaa 3168 
Ala His Cys Val Ser Gin Trp Gly His Asp Phe Arg Pro Asp Tyr Lys 
1045 1050 1055 

get ata gga gag ttt cgt aag agg ttt ccc gga gtt ccg gtc atg get 3216 
Ala He Gly Glu Phe Arg Lys Arg Phe Pro Gly Val Pro Val Met Ala 
1060 1065 1070 

ttg aca gcg acg gca aca cag aac gtc ate ttg gat gtc aag cat aac 3264 
Leu Thr Ala Thr Ala Thr Gin Asn Val He Leu Asp Val Lys His Asn 
1075 1080 1085 

ctg gca atg gag gac tgc cag act ttc tec cag age ttt aat egg ccg 3312 
Leu Ala Met Glu Asp Cys Gin Thr Phe Ser Gin Ser Phe Asn Arg Pro 
1090 1095 1100 

aac etc tac tat gag gtc agg atg aag gag cag aat ctg att gee cgc 3360 
Asn Leu Tyr Tyr Glu Val Arg Met Lys Glu Gin Asn Leu He Ala Arg 
1105 1110 1115 1120 
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ate gca gag ttg ate aag gag aag tat gac ggc cag acg ggt ate ate 3408 
lie Ala Glu Leu lie Lys Glu Lys Tyr Asp Gly Gin Thr Gly lie He 
1125 1130 H35 

tac aca tta tea aga aag agt gee gag aac ate gee aaa aat etc cag 3456 
Tyr Thr Leu Ser Arg Lys Ser Ala Glu Asn He Ala Lys Asn Leu Gin 
1140 H45 1150 

gaa aaa cac cgc ate aaa gca aag cac tac cat gec tec ate ace ace 3504 
Glu Lys His Arg He Lys Ala Lys His Tyr His Ala Ser He Thr Thr 
1155 1160 1165 

gac gaa aag ate age gtt caa cat gaa tgg caa acc ggc cga gtc aaa 3552 
Asp Glu Lys He Ser Val Gin His Glu Trp Gin Thr Gly Arg Val Lys 
1170 1175 1180 

gtc gtg gta gee acc att gee ttc ggc atg ggc ate gac aag cct gac 3600 
Val Val Val Ala Thr He Ala Phe Gly Met Gly He Asp Lys Pro Asp 
1185 1190 1195 1200 

gtc cgc ttt gtt ate cac cag cac ate ccc aag teg etc gaa ggt tac 3648 
Val Arg Phe Val He His Gin His He Pro Lys Ser Leu Glu Gly Tyr 
1205 1210 1215 

tat caa gaa acc ggc cgc gec gga cgt gac ggc aag cca teg gac tgc 3696 
Tyr Gin Glu Thr Gly Arg Ala Gly Arg Asp Gly Lys Pro Ser Asp Cys 
1220 1225 1230 

tac ttg tac ttt gee tat ggc gac att caa tec eta cgt cgt atg ate 3744 
Tyr Leu Tyr Phe Ala Tyr Gly Asp He Gin Ser Leu Arg Arg Met He 
1235 1240 1245 

gec gac ggc gaa ggt gac tac gcg caa aag gag cgt cag eta caa atg 3792 
Ala Asp Gly Glu Gly Asp Tyr Ala Gin Lys Glu Arg Gin Leu Gin Met 
1250 1255 1260 

etc aac cgt gtg gtc age tat tgc gag teg cag cac acg tgc egg cgc 3840 
Leu Asn Arg Val Val Ser Tyr Cys Glu Ser Gin His Thr Cys Arg Arg 
1265 1270 1275 1280 

gaa gaa gtg etc cgc tac ttt ggc gag gag ttt gac tac egg aag tgt 3888 
Glu Glu Val Leu Arg Tyr Phe Gly Glu Glu Phe Asp Tyr Arg Lys Cys 
1285 1290 1295 

aga gac gga tgc gat aac tgc egg aac gga cgc ate teg aag teg acg 3936 
Arg Asp Gly Cys Asp Asn Cys Arg Asn Gly Arg He Ser Lys Ser Thr 
1300 1305 1310 
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gag atg agg gat ttt acg gaa ate gec ttc gec gcg ate gag gtg gtg 3984 
Glu Met Arg Asp Phe Thr Glu He Ala Phe Ala Ala He Glu Val Val 
1315 1320 1325 

aag age cag cag ccc ate acg ctg ggc aag ctg tgc gac ate ctg atg 4032 
Lys Ser Gin Gin Pro He Thr Leu Gly Lys Leu Cys Asp He Leu Met 
1330 1335 1340 

ggc aag aga aag aac gag cac ggt ggc gtg tgt cac ttt ggt ate gee 4080 
Gly Lys Arg Lys Asn Glu His Gly Gly Val Cys His Phe Gly He Ala 
1345 1350 1355 1360 

aag ggg age acg cag agg gag ctg cag agg ate gtg ctg cag ctg aat 4128 
Lys Gly Ser Thr Gin Arg Glu Leu Gin Arg He Val Leu Gin Leu Asn 
1365 1370 1375 

ttc cac aag gcg ctg ggc gag gac aat ate atg aaz ggg gcg ggg atg 4176 
Phe His Lys Ala Leu Gly Glu Asp Asn He Met Asn Gly Ala Gly Met 
1380 1385 1390 

cct att ace tac tat att acc ggc cct gaa get ggt get tac etc tac 4224 
Pro He Thr Tyr Tyr He Thr Gly Pro Glu Ala Gly Ala Tyr Leu Tyr 
1395 1400 1405 

aat ggc aag egg ttg atg ctg cca gtt ccc tea aac aag tec gtc gaa 4272 
Asn Gly Lys Arg Leu Met Leu Pro Val Pro Ser Asn Lys Ser Val Glu 
1410 1415 1420 

ccc ccg tct egg tct aag cag egg age cgt cga gtc gat gag gat atg 4320 
Pro Pro Ser Arg Ser Lys Gin Arg Ser Arg Arg Val Asp Glu Asp Met 
1425 1430 1435 1440 

gat gag caa gaa ctt tec acc ctg caa cga ccg cca aca tea aca aat 4368 
Asp Glu Gin Glu Leu Ser Thr Leu Gin Arg Pro Pro Thr Ser Thr Asn 
1445 1450 1455 

gtc tct tea ccc gtt cga gee acc aag aaa cga agt tec aaa aag get 4416 
Val Ser Ser Pro Val Arg Ala Thr Lys Lys Arg Ser Ser Lys Lys Ala 
1460 1465 1470 

tta ccg acc etc ate gec gac tac gaa gag ccc age tec gac ggt cct 4 4 64 
Leu Pro Thr Leu He Ala Asp Tyr Glu Glu Pro Ser Ser Asp Gly Pro 
1475 1480 1485 

cac ggt cct etc cac gee aac ggc tat gag cgc gat aac ttt gtc gta 4512 
His Gly Pro Leu His Ala Asn Gly Tyr Glu Arg Asp Asn Phe Val Val 
1490 1495 1500 
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tec gat aat gtt gaa ccc gaa gag gaa gaa gat gec ttc gaa cct gtc 4560 
Ser Asp Asn Val Glu Pro Glu Glu Glu Glu Asp Ala Phe Glu Pro Val 
1505 1510 1515 1520 

cgc ccc teg egg cgc ggc cca tct tct cgc get ace cgc cct caa cac 4608 
Arg Pro Ser Arg Arg Gly Pro Ser Ser Arg Ala Thr Arg Pro Gin His 
1525 1530 1535 

cgc cag acc ace ctt tat gac acc etc tec cac ace caa caa tec caa 4656 
Arg Gin Thr Thr Leu Tyr Asp Thr Leu Ser His Thr Gin Gin Ser Gin 
1540 1545 1550 

acc gtc tec caa cac etc gec act ttg ggt ccg ccc ate gac gee cgc 4704 
Thr Val Ser Gin His Leu Ala Thr Leu Gly Pro Pro lie Asp Ala Arg 
1555 1560 1565 

acc atg cat aac ccc cgc tac gec cag ctt gac gag gtc cac cag gat 4752 
Thr Met His Asn Pro Arg Tyr Ala Gin Leu Asp Glu Val His Gin Asp 
1570 1575 1580 

att gtc gat gec ttt gtt gaa gaa gtc aag gtc ttc gag gag gac ttt 4800 
He Val Asp Ala Phe Val Glu Glu Val Lys Val Phe Glu Glu Asp Phe 
1585 1590 1595 1600 

cgc aac agg aac cac atg cgc aaa ccc ate ttt acc gag acg cag tac 4848 
Arg Asn Arg Asn His Met Arg Lys Pro He Phe Thr Glu Thr Gin Tyr 
1605 1610 1615 

cgt gag atg gca ate egg tgg acg egg teg tta gac gcg atg cgc gcg 4896 
Arg Glu Met Ala He Arg Trp Thr Arg Ser Leu Asp Ala Met Arg Ala 
1620 1625 1630 

ate ccg gat ate aac cag gat aaa gta gat egg taz ggt gee aaa ttc 494 4 
He Pro Asp He Asn Gin Asp Lys Val Asp Arg Tyr Gly Ala Lys Phe 
1635 1640 1645 

ate cca ctt gtg gag egg ttc tgg ggg aat tat cag gag atg atg ggg 4 992 
He Pro Leu Val Glu Arg Phe Trp Gly Asn Tyr Gin Glu Met Met Gly 
1650 1655 1660 

gga ggg tat gat aat cct get gtg get ggc gat gag gat gat gat gag 504 0 
Gly Gly Tyr Asp Asn Pro Ala Val Ala Gly Asp Glu Asp Asp Asp Glu 
1665 1670 1675 1680 

ggc ccc agg agg aca gga aat gga aaa ggg ggg aat aag aag gga gga 5088 
Gly Pro Arg Arg Thr Gly Asn Gly Lys Gly Gly Asn Lys Lys Gly Gly 
1685 1690 1695 
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gga gga gga gga gga aat gaa gta gtg gat ttg att agt agt gat gag 5136 
Gly Gly Gly Gly Gly Asn Glu Val Val Asp Leu lie Ser Ser Asp Glu 
1700 1705 1710 

gat gaa cct ccg get cgt gca cca teg egg aat gcg ggg cga gga aag 5184 
Asp Glu Pro Pro Ala Arg Ala Pro Ser Arg Asn Ala Gly Arg Gly Lys 
1715 1720 1725 

gca cag teg aca cgt ggg gga caa ate caa gat aaa ggc cga gca gtc 5232 
Ala Gin Ser Thr Arg Gly Gly Gin lie Gin Asp Lys Gly Arg Ala Val 
1730 1735 1740 

aac cgc cgc gga gaa ccc ate gec gaa gaa gac gaa gaa gac tac ggg 5280 
Asn Arg Arg Gly Glu Pro lie Ala Glu Glu Asp Glu Glu Asp Tyr Gly 
1745 1750 1755 1760 

eta age gac ccc gat ate gac gee ate gat cca gac gee ate acc gec 5328 
Leu Ser Asp Pro Asp lie Asp Ala lie Asp Pro Asp Ala lie Thr Ala 
1765 1770 1775 

tec gac aac tec gac gaa gaa gat gat gat gat gat gac gaa gac etc 5376 
Ser Asp Asn Ser Asp Glu Glu Asp Asp Asp Asp Asp Asp Glu Asp Leu 
1780 1785 1790 

gaa tec tec cgc tac ttc tec ggc tea aca ggc ccg ccc gtc tec aaa 5424 
Glu Ser Ser Arg Tyr Phe Ser Gly Ser Thr Gly Pro Pro Val Ser Lys 
1795 1800 1805 

gee gtg cag gat get cga etc cgt gaa caa ctt tec atg tac gee tec 5472 
Ala Val Gin Asp Ala Arg Leu Arg Glu Gin Leu Ser Met Tyr Ala Ser 
1810 1815 1820 

ggc ggc age tct teg aaa ggt age tac ggc tea ggg cgc gca tea gga 5520 
Gly Gly Ser Ser Ser Lys Gly Ser Tyr Gly Ser Gly Arg Ala Ser Gly 
1825 1830 1835 1840 

gga tct teg teg aga gcg teg gga tea gga tgg aga ggt gga gga gca 5568 
Gly Ser Ser Ser Arg Ala Ser Gly Ser Gly Trp Arg Gly Gly Gly Ala 
1845 1850 1855 

ggt ggg aag aaa tac tac agg aag aag agg get ggt tct teg get get 5616 
Gly Gly Lys Lys Tyr Tyr Arg Lys Lys Arg Ala Gly Ser Ser Ala Ala 
1860 1865 1870 

ggt ggt ggt ggt gca gga gga ggg gga gtt aca aaa egg aag gcg agt 5664 
Gly Gly Gly Gly Ala Gly Gly Gly Gly Val Thr Lys Arg Lys Ala Ser 
1875 1880 1885 
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ggg agt ggc gcg aag acg gcg agg aag agg ggt gca tct act gcg ccg 5712 
Gly Ser Gly Ala Lys Thr Ala Arg Lys Arg Gly Ala Ser Thr Ala Pro 
1890 1895 1900 

aag aca acg acg aga ggg gga gga tct gga get ggg tct aga gga ggc 5760 
Lys Thr Thr Thr Arg Gly Gly Gly Ser Gly Ala Gly Ser Arg Gly Gly 
1905 1910 1915 1920 

ggt get ggt ggt get ggt ggt get ggt get ggt get ggt get gga gga 5808 
Gly Ala Gly Gly Ala Gly Gly Ala Gly Ala Gly Ala Gly Ala Gly Gly 
1925 1930 1935 

ggg aaa agg ggt ggt gga ggt gga gga gga atg gga ggg ata agt gtt 5856 
Gly Lys Arg Gly Gly Gly Gly Gly Gly Gly Met Gly Gly He Ser Val 
1940 1945 1950 

atg cct cat tag 5868 
Met Pro His 
1955 



<210> 3 
<211> 1955 
<212> PRT 

<213> Neurospora crassa 
<400> 3 

Met Ala Lys Leu Ser Val Lys Asn Asn Leu Pro Arg Pro His Leu Val 

15 10 15' 

Ser Leu Ser Ser Ser Thr Thr Gly Ser Gly Ser Gly Ser Ala Ser Arg 

20 25 30 

Ser Ala Ser Ala Lys His Gly Ser Ala Gly Ser Ser Thr Phe Asp His 

35 40 45 

Glu Gin His Gin Gin His Gin Gin Gin Gin Gin Gin Lys Arg Gin Arg 

50 55 60 

Ser Gin Ser Glu Ala Arg Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
65 70 75 80 

Gin Gin Gin Gin Gin Gin Gin Gin Ala Gin His His Ala His Ser Thr 

85 90 95 

Tyr Ala Gin Arg Pro Gin Pro Thr Pro Gin Gin Arg Pro Pro Gin Asn 

100 105 110 

Leu Leu Thr Pro Ala Ser Thr Thr Gly Ala Ser Val Gly Pro Leu Gin 

115 120 125 

Arg Ala Tyr Ser Val Ser Leu Ala Ala Arg Gin Ser Pro Ser Thr Asn 

130 135 140 

Leu Val Arg Pro Lys Thr Asp Ser Pro Ala Pro His Thr Leu His Leu 
145 150 155 160 

Lys Asn Lys Lys Asn Leu Arg His Pro Ala Pro Thr Pro Asp Ser Pro 
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165 170 175 

He Val Asp Asp Asp He Phe Ser Asp Ala Val Asp Leu Thr Glu Glu 

180 185 190 

Leu Asp His Asp His Asp Leu Asn Gly Lys Asp Lys Asp Asn Thr Asp 

195 200 205 

Asn Asp Asn Thr Val Ala Ser Ser Ser Leu He Gly Phe Gly Asp Asp 

210 215 220 

Lys Leu Leu Trp Arg Glu Asp Phe Ala Glu Arg Ala Glu Pro Glu His 
225 230 235 240 

Glu Arg Gly Gly Ser Arg Pro Arg Gin Val Lys Lys Arg Lys He Ser 

245 250 255 

Asn Asp Tyr He Met Lys Asp Glu Asp Val Ser Leu Phe Asp Asp Asp 

260 265 270 

Gly Glu Glu Asp Glu Phe Met Asp He Asn Glu Leu Val Gin Gly Asp 

275 280 285 

Arg Glu Ser Thr Pro Lys Pro Lys Ala Thr Ser Arg Ser Val Ser Thr 

290 295 300 

Arg Leu Pro Pro Thr Val Ser Leu Gin Arg Gly Arg Ser Pro Lys Arg 
305 310 315 320 

Lys Glu Ala Ser Val Glu Lys Arg Thr Thr Glu Asn Gin Gin Gin Ala 

325 330 335 

Asp Arg Glu Asp Glu Pro Ser Phe Met Ser Ser Pro Asp Val Asp Asn 

340 345 350 

Ser Arg Lys Arg Lys Ser Ser Gly Ser Pro Thr Gly Leu Thr Thr Pro 

355 360 365 

Arg Pro Gin Gin Lys Gin Thr Glu Glu Val Pro Gly Thr Thr Thr Ala 

370 375 380 

Lys Lys Pro Arg Arg Ser Glu Val Met Asp Ser Glu Asp Glu Ala Phe 
385 390 395 400 

Thr Pro Leu Ser Ala Gly Ser Leu Pro Gly Ser Ala Glu Phe Phe Arg 

405 410 415 

Ser Gly Gly Thr Thr Thr Arg Glu Leu Gly Leu Asp Glu Asp Thr Val 

420 425 430 

Met Asp Thr Pro Ser Arg Pro Pro Val Glu Ser Thr Leu Pro Thr Leu 

435 440 445 

Glu Ser Val Glu Ser Arg Pro Pro Pro Leu Pro Pro Met Asp Leu Pro 

450 455 460 

Ser Gin Arg Lys Pro Leu Glu Pro Leu Asn Thr Pro Arg Asn Gin Leu 
465 470 475 480 

Leu Glu Ser Val Glu Arg Pro Thr Gin Gin Pro Ser Val Gly Pro Ser 

485 490 495 

Phe Ala Gin Ser Ser Thr Leu Ala Glu Ser Ser Leu Pro Pro Ser Met 

500 505 510 

Pro Pro Pro Ser Glu Asp Pro Leu Asn Thr Arg Glu Asn Ser Asn Leu 

515 520 525 

Glu Glu Phe Asp Tyr Lys Leu Tyr Lys Pro Leu Leu Asp Leu Phe Val 

530 535 540 

Asn Ala Pro Ala He Leu Glu Arg Glu Leu Ser Ala Val Asn Asp Glu 
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545 550 555 560 

Leu Gin Glu Asn Met lie Lys Leu Arg Asp Cys Leu Arg Leu Pro Arg 

565 570 575 

Glu Glu Arg Asp Arg Ala Arg Glu Glu Val Lys Lys Glu Lys Glu Met 

580 585 590 

Leu Lys Arg Arg Asp lie Ala Leu Arg Ala Leu Gin Asp Glu His Lys 

595 600 605 

Leu Tyr Val Lys Lys Arg Lys Glu His Asn Leu He Asn Glu Glu He 

610 615 620 

Val Arg Ala Tyr Ala Glu Glu Asp Asp Glu Tyr Glu Asp Gin Leu Met 
625 630 635 640 

Ala Gin Leu Asp Lys Leu Asp Asp Glu Val Glu Ala He Val Lys Ser 

645 650 655 

Leu Thr Arg Leu He Val Ala Ala Gly He Thr Glu Lys Ser Phe Asp 

660 665 670 

Leu Lys Lys Glu Glu Glu Glu Glu Glu Glu Lys Pro He He lie Ala 

675 680 685 

Thr Pro Thr Pro Ser Thr Arg Thr Glu Ala Pro Val Leu Pro Thr Thr 

690 695 700 

Glu Tyr His Asn Ser Gin Gin Val He Leu Gin Thr Gin His Pro Ala 
705 710 715 720 

Ala Gin Gin Val Ser His Arg Val Pro Pro Pro Pro Thr Pro Ser Phe 

725 730 735 

Gin Thr Ala Arg Gin Thr Pro Val Ser Tyr Gin Ser Arg Pro Thr Asn 

740 745 750 

Asn Ser Phe Pro Asp He Ser Ala Glu Glu Ala Met Met Phe Asp Lys 

755 760 765 

Glu Asp Pro Phe Met Glu Gin Gin His Ala Pro Ala Ser Ala Pro Phe 

770 775 780 

Gin Ala Thr Leu Pro Gin Arg Asn Ser Pro Phe Lys Thr Ala Pro Phe 
785 790 795 800 

Lys Pro Val His Gly His Asp Tyr Phe Asp Asp Glu Asp Asp Asp Ala 

805 810 815 

Asp Leu Leu Ala Ala Val Asp Ser Ala Glu Thr Tyr Thr Ser Thr Ala 

820 825 830 

Ala Thr Thr Thr Thr Asn Asn Asn Asn His Leu Arg Ser Gin Ser Val 

835 840 845 

Met Ser Thr Ser Thr Ala Thr Thr He Lys Pro Arg Lys Arg Asn Glu 

850 855 860 

Asn Ala Asn Ala Lys Lys Pro Lys Ser Val His Ala Lys Leu Ser Met 
865 870 875 880 

Pro Pro Glu Lys Met Lys Tyr Ala Trp Ser Asn Asp Val Arg Lys Ala 

885 890 895 

Leu Lys Asp Arg Phe Arg Met Ser Gly Phe Arg Gin Asn Gin Leu Glu 

900 905 910 

Ala He Asn Ala Thr Leu Gly Gly Lys Asp Ala Phe Val Leu Met Pro 

915 920 925 

Thr Gly Gly Gly Lys Ser Leu Cys Tyr Gin Leu Pro Ala Val Val Arg 
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930 935 940 

Ser Gly Lys Thr Arg Gly He Thr Val Val He Ser Pro Leu Leu Ser 
945 950 955 960 

Leu Met Leu Asp Gin Val Asn His Leu Ala Asn Leu Met He Gin Ala 

965 970 975 

Tyr Ala Phe Asn Gly Asp Met Asn Ser Glu Met Arg Arg Met Val Phe 

980 985 990 

Gin Lys Leu Asp Ala Glu His Pro Glu His Glu Leu Gin Leu Leu Tyr 

995 1000 1005 

Val Thr Pro Glu Met Val Ser Lys Asn Gin Thr Phe Val Asn Lys Met 

1010 1015 1020 

Met Asp Leu Tyr Arg Arg Lys Lys Leu Ala Arg He Val He Asp Glu 
1025 1030 1035 1040 

Ala His Cys Val Ser Gin Trp Gly His Asp Phe Arg Pro Asp Tyr Lys 

1045 1050 1055 

Ala He Gly Glu Phe Arg Lys Arg Phe Pro Gly Val Pro Val Met Ala 

1060 1065 1070 

Leu Thr Ala Thr Ala Thr Gin Asn Val He Leu Asp Val Lys His Asn 

1075 1080 1085 

Leu Ala Met Glu Asp Cys Gin Thr Phe Ser Gin Ser Phe Asn Arg Pro 

1090 1095 1100 

Asn Leu Tyr Tyr Glu Val Arg Met Lys Glu Gin Asn Leu He Ala Arg 
1105 1110 1115 1120 

He Ala Glu Leu He Lys Glu Lys Tyr Asp Gly Gin Thr Gly He He 

1125 1130 1135 

Tyr Thr Leu Ser Arg Lys Ser Ala Glu Asn He Ala Lys Asn Leu Gin 

1140 1145 1150 

Glu Lys His Arg lie Lys Ala Lys His Tyr His Ala Ser He Thr Thr 

1155 1160 1165 

Asp Glu Lys He Ser Val Gin His Glu Trp Gin Thr Gly Arg Val Lys 

1170 1175 1180 

Val Val Val Ala Thr He Ala Phe Gly Met Gly He Asp Lys Pro Asp 
1185 1190 1195 1200 

Val Arg Phe Val He His Gin His He Pro Lys Ser Leu Glu Gly Tyr 

1205 1210 1215 

Tyr Gin Glu Thr Gly Arg Ala Gly Arg Asp Gly Lys Pro Ser Asp Cys 

1220 1225 1230 

Tyr Leu Tyr Phe Ala Tyr Gly Asp He Gin Ser Leu Arg Arg Met He 

1235 1240 1245 

Ala Asp Gly Glu Gly Asp Tyr Ala Gin Lys Glu Arg Gin Leu Gin Met 

1250 1255 1260 

Leu Asn Arg Val Val Ser Tyr Cys Glu Ser Gin His Thr Cys Arg Arg 
1265 1270 1275 1280 

Glu Glu Val Leu Arg Tyr Phe Gly Glu Glu Phe Asp Tyr Arg Lys Cys 

1285 1290 1295 

Arg Asp Gly Cys Asp Asn Cys Arg Asn Gly Arg He Ser Lys Ser Thr 

1300 1305 1310 

Glu Met Arg Asp Phe Thr Glu He Ala Phe Ala Ala He Glu Val Val 
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1315 1320 1325 

Lys Ser Gin Gin Pro lie Thr Leu Gly Lys Leu Cys Asp lie Leu Met 

1330 1335 1340 

Gly Lys Arg Lys Asn Glu His Gly Gly Val Cys His Phe Gly lie Ala 
1345 1350 1355 1360 

Lys Gly Ser Thr Gin Arg Glu Leu Gin Arg lie Val Leu Gin Leu Asn 

1365 1370 1375 

Phe His Lys Ala Leu Gly Glu Asp Asn lie Met Asn Gly Ala Gly Met 

1380 1385 1390 

Pro lie Thr Tyr Tyr lie Thr Gly Pro Glu Ala Gly Ala Tyr Leu Tyr 

1395 1400 1405 

Asn Gly Lys Arg Leu Met Leu Pro Val Pro Ser Asn Lys Ser Val Glu 

1410 1415 1420 

Pro Pro Ser Arg Ser Lys Gin Arg Ser Arg Arg Val Asp Glu Asp Met 
1425 1430 1435 1440 

Asp Glu Gin Glu Leu Ser Thr Leu Gin Arg Pro Pro Thr Ser Thr Asn 

1445 1450 1455 

Val Ser Ser Pro Val Arg Ala Thr Lys Lys Arg Ser Ser Lys Lys Ala 

1460 1465 1470 

Leu Pro Thr Leu lie Ala Asp Tyr Glu Glu Pro Ser Ser Asp Gly Pro 

1475 1480 1485 

His Gly Pro Leu His Ala Asn Gly Tyr Glu Arg Asp Asn Phe Val Val 

1490 1495 1500 

Ser Asp Asn Val Glu Pro Glu Glu Glu Glu Asp Ala Phe Glu Pro Val 
1505 1510 1515 1520 

Arg Pro Ser Arg Arg Gly Pro Ser Ser Arg Ala Thr Arg Pro Gin His 

1525 1530 1535 

Arg Gin Thr Thr Leu Tyr Asp Thr Leu Ser His Thr Gin Gin Ser Gin 

1540 1545 1550 

Thr Val Ser Gin His Leu Ala Thr Leu Gly Pro Pro lie Asp Ala Arg 

1555 1560 1565 

Thr Met His Asn Pro Arg Tyr Ala Gin Leu Asp Glu Val His Gin Asp 

1570 1575 1580 

lie Val Asp Ala Phe Val Glu Glu Val Lys Val Phe Glu Glu Asp Phe 
1585 1590 1595 1600 

Arg Asn Arg Asn His Met Arg Lys Pro lie Phe Thr Glu Thr Gin Tyr 

1605 1610 1615 

Arg Glu Met Ala He Arg Trp Thr Arg Ser Leu Asp Ala Met Arg Ala 

1620 1625 1630 

lie Pro Asp He Asn Gin Asp Lys Val Asp Arg Tyr Gly Ala Lys Phe 

1635 1640 1645 

He Pro Leu Val Glu Arg Phe Trp Gly Asn Tyr Gin Glu Met Met Gly 

1650 1655 1660 

Gly Gly Tyr Asp Asn Pro Ala Val Ala Gly Asp Glu Asp Asp Asp Glu 
1665 1670 1675 1680 

Gly Pro Arg Arg Thr Gly Asn Gly Lys Gly Gly Asn Lys Lys Gly Gly 

1685 1690 1695 

Gly Gly Gly Gly Gly Asn Glu Val Val Asp Leu He Ser Ser Asp Glu 
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1700 1705 1710 

Asp Glu Pro Pro Ala Arg Ala Pro Ser Arg Asn Ala Gly Arg Gly Lys 

1715 1720 1725 

Ala Gin Ser Thr Arg Gly Gly Gin lie Gin Asp Lys Gly Arg Ala Val 

1730 1735 1740 

Asn Arg Arg Gly Glu Pro lie Ala Glu Glu Asp Glu Glu Asp Tyr Gly 
1745 1750 1755 1760 

Leu Ser Asp Pro Asp lie Asp Ala He Asp Pro Asp Ala He Thr Ala 

1765 1770 1775 

Ser Asp Asn Ser Asp Glu Glu Asp Asp Asp Asp Asp Asp Glu Asp Leu 

1780 1785 1790 

Glu Ser Ser Arg Tyr Phe Ser Gly Ser Thr Gly Pro Pro Val Ser Lys 

1795 1800 1805 

Ala Val Gin Asp Ala Arg Leu Arg Glu Gin Leu Ser Met Tyr Ala Ser 

1810 1815 1820 

Gly Gly Ser Ser Ser Lys Gly Ser Tyr Gly Ser Gly Arg Ala Ser Gly 
1825 1830 1835 1840 

Gly Ser Ser Ser Arg Ala Ser Gly Ser Gly Trp Arg Gly Gly Gly Ala 

1845 1850 1855 

Gly Gly Lys Lys Tyr Tyr Arg Lys Lys Arg Ala Gly Ser Ser Ala Ala 

1860 1865 1870 

Gly Gly Gly Gly Ala Gly Gly Gly Gly Val Thr Lys Arg Lys Ala Ser 

1875 1880 1885 

Gly Ser Gly Ala Lys Thr Ala Arg Lys Arg Gly Ala Ser Thr Ala Pro 

1890 1895 1900 

Lys Thr Thr Thr Arg Gly Gly Gly Ser Gly Ala Gly Ser Arg Gly Gly 
1905 1910 1915 1920 

Gly Ala Gly Gly Ala Gly Gly Ala Gly Ala Gly Ala Gly Ala Gly Gly 

1925 t 1930 1935 

Gly Lys Arg Gly Gly Gly Gly Gly Gly Gly Met Gly Gly He Ser Val 
1940 1945 1950 

Met Pro His 
1955 
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